Overview

Dataset statistics

Number of variables16
Number of observations46059
Missing cells23313
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory52.2 MiB
Average record size in memory1.2 KiB

Variable types

CAT8
NUM6
BOOL2

Warnings

is_retweet has constant value "46059" Constant
user_name has a high cardinality: 25553 distinct values High cardinality
user_location has a high cardinality: 9345 distinct values High cardinality
user_description has a high cardinality: 24494 distinct values High cardinality
user_created has a high cardinality: 25871 distinct values High cardinality
date has a high cardinality: 45622 distinct values High cardinality
text has a high cardinality: 46018 distinct values High cardinality
hashtags has a high cardinality: 16835 distinct values High cardinality
source has a high cardinality: 171 distinct values High cardinality
user_location has 10365 (22.5%) missing values Missing
user_description has 3090 (6.7%) missing values Missing
hashtags has 9816 (21.3%) missing values Missing
user_friends is highly skewed (γ1 = 37.72401569) Skewed
retweets is highly skewed (γ1 = 88.08390544) Skewed
favorites is highly skewed (γ1 = 66.48283436) Skewed
date is uniformly distributed Uniform
text is uniformly distributed Uniform
id has unique values Unique
user_favourites has 671 (1.5%) zeros Zeros
retweets has 30075 (65.3%) zeros Zeros
favorites has 19255 (41.8%) zeros Zeros

Reproduction

Analysis started2021-05-16 01:26:18.473024
Analysis finished2021-05-16 01:26:48.238165
Duration29.77 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

id
Real number (ℝ≥0)

UNIQUE

Distinct46059
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.367027866e+18
Minimum1.337727768e+18
Maximum1.378952133e+18
Zeros0
Zeros (%)0.0%
Memory size360.0 KiB
2021-05-15T20:26:48.427865image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum1.337727768e+18
5-th percentile1.346330701e+18
Q11.363362996e+18
median1.368339972e+18
Q31.373482143e+18
95-th percentile1.37768748e+18
Maximum1.378952133e+18
Range4.122436498e+16
Interquartile range (IQR)1.011914633e+16

Descriptive statistics

Standard deviation9.076527862e+15
Coefficient of variation (CV)0.006639607056
Kurtosis1.402304167
Mean1.367027866e+18
Median Absolute Deviation (MAD)5.077735542e+15
Skewness-1.265038672
Sum5.198936432e+18
Variance8.238335802e+31
MonotocityNot monotonic
2021-05-15T20:26:48.723952image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1.364449897e+181< 0.1%
 
1.377326662e+181< 0.1%
 
1.378623397e+181< 0.1%
 
1.37294314e+181< 0.1%
 
1.344406587e+181< 0.1%
 
1.374362908e+181< 0.1%
 
1.366695895e+181< 0.1%
 
1.362913572e+181< 0.1%
 
1.37190737e+181< 0.1%
 
1.370372644e+181< 0.1%
 
1.365974589e+181< 0.1%
 
1.374436836e+181< 0.1%
 
1.373732669e+181< 0.1%
 
1.366220823e+181< 0.1%
 
1.365974535e+181< 0.1%
 
1.367100394e+181< 0.1%
 
1.36859012e+181< 0.1%
 
1.373767181e+181< 0.1%
 
1.358497844e+181< 0.1%
 
1.366245596e+181< 0.1%
 
1.369686437e+181< 0.1%
 
1.358480395e+181< 0.1%
 
1.370425425e+181< 0.1%
 
1.354223068e+181< 0.1%
 
1.369791286e+181< 0.1%
 
Other values (46034)4603499.9%
 
ValueCountFrequency (%) 
1.337727768e+181< 0.1%
 
1.337728702e+181< 0.1%
 
1.337732077e+181< 0.1%
 
1.337732996e+181< 0.1%
 
1.337733049e+181< 0.1%
 
1.337733857e+181< 0.1%
 
1.337733928e+181< 0.1%
 
1.33773407e+181< 0.1%
 
1.337735596e+181< 0.1%
 
1.337739608e+181< 0.1%
 
ValueCountFrequency (%) 
1.378952133e+181< 0.1%
 
1.37895209e+181< 0.1%
 
1.378949363e+181< 0.1%
 
1.378948927e+181< 0.1%
 
1.378946878e+181< 0.1%
 
1.378946025e+181< 0.1%
 
1.378945931e+181< 0.1%
 
1.378943663e+181< 0.1%
 
1.378941453e+181< 0.1%
 
1.378941317e+181< 0.1%
 

user_name
Categorical

HIGH CARDINALITY

Distinct25553
Distinct (%)55.5%
Missing0
Missing (%)0.0%
Memory size360.0 KiB
Workout Solutions
 
1026
Sputnik
 
284
Xukki🌍
 
219
China Economy
 
184
Sputnik V
 
170
Other values (25548)
44176 
ValueCountFrequency (%) 
Workout Solutions10262.2%
 
Sputnik2840.6%
 
Xukki🌍2190.5%
 
China Economy1840.4%
 
Sputnik V1700.4%
 
ILKHA1350.3%
 
MaryRobotic1320.3%
 
William Owen1260.3%
 
Tradia Inc1210.3%
 
Shen Shiwei沈诗伟1190.3%
 
People's Daily, China1100.2%
 
New Straits Times910.2%
 
Brazil SFE900.2%
 
ChineseEmbassyManila880.2%
 
CGTN790.2%
 
The Peninsula Qatar730.2%
 
CCTV+680.1%
 
People's Daily app650.1%
 
RT580.1%
 
Tibetans560.1%
 
ME550.1%
 
China News 中国新闻网550.1%
 
RiverRising550.1%
 
IANS Tweets550.1%
 
@shalinisharma87530.1%
 
Other values (25528)4249292.3%
 
2021-05-15T20:26:49.259659image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique20054 ?
Unique (%)43.5%
2021-05-15T20:26:49.684166image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length50
Median length13
Mean length14.35502291
Min length1

Overview of Unicode Properties

Unique unicode characters2863
Unique unicode categories23 ?
Unique unicode scripts54 ?
Unique unicode blocks89 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a559918.5%
 
555888.4%
 
e462467.0%
 
i406866.2%
 
n361515.5%
 
r326694.9%
 
o301374.6%
 
s242743.7%
 
t240863.6%
 
l229043.5%
 
h190742.9%
 
u150922.3%
 
d129462.0%
 
m113121.7%
 
S111751.7%
 
c111441.7%
 
y98551.5%
 
M88361.3%
 
k87101.3%
 
A84541.3%
 
g72591.1%
 
C71021.1%
 
T69981.1%
 
D66231.0%
 
R60990.9%
 
Other values (2838)14176721.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter44279467.0%
 
Uppercase Letter11717617.7%
 
Space Separator555948.4%
 
Other Symbol150022.3%
 
Other Punctuation89031.3%
 
Other Letter74121.1%
 
Decimal Number41980.6%
 
Nonspacing Mark28300.4%
 
Format13030.2%
 
Dash Punctuation12380.2%
 
Spacing Mark10270.2%
 
Close Punctuation8010.1%
 
Open Punctuation7790.1%
 
Connector Punctuation6350.1%
 
Math Symbol5680.1%
 
Modifier Symbol3360.1%
 
Final Punctuation169< 0.1%
 
Modifier Letter107< 0.1%
 
Initial Punctuation87< 0.1%
 
Currency Symbol71< 0.1%
 
Private Use52< 0.1%
 
Enclosing Mark48< 0.1%
 
Other Number48< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
S111759.5%
 
M88367.5%
 
A84547.2%
 
C71026.1%
 
T69986.0%
 
D66235.7%
 
R60995.2%
 
N57464.9%
 
P56094.8%
 
B55554.7%
 
E47154.0%
 
I43073.7%
 
H41033.5%
 
K39453.4%
 
L38863.3%
 
J37223.2%
 
G36573.1%
 
W35893.1%
 
F30122.6%
 
V26312.2%
 
O26052.2%
 
U12541.1%
 
Y10210.9%
 
Z5840.5%
 
X5060.4%
 
Other values (307)14421.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a5599112.6%
 
e4624610.4%
 
i406869.2%
 
n361518.2%
 
r326697.4%
 
o301376.8%
 
s242745.5%
 
t240865.4%
 
l229045.2%
 
h190744.3%
 
u150923.4%
 
d129462.9%
 
m113122.6%
 
c111442.5%
 
y98552.2%
 
k87102.0%
 
g72591.6%
 
p56191.3%
 
b53261.2%
 
v50071.1%
 
w49171.1%
 
f33710.8%
 
z22980.5%
 
j19810.4%
 
x10360.2%
 
Other values (452)47031.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
55588> 99.9%
 
 6< 0.1%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
🇺7344.9%
 
🇮7164.8%
 
🇳6964.6%
 
💙5593.7%
 
🇪4893.3%
 
🇸4032.7%
 
😷3932.6%
 
🌈3232.2%
 
🇬3202.1%
 
🇧3162.1%
 
🌍2821.9%
 
🇦2771.8%
 
🏳2561.7%
 
🇨2291.5%
 
🌊2071.4%
 
🇷2051.4%
 
🇰1821.2%
 
🏴1771.2%
 
🇵1681.1%
 
🇭1611.1%
 
1541.0%
 
🇱1531.0%
 
🇲1270.8%
 
🇹1160.8%
 
🇿1130.8%
 
Other values (862)724648.3%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.347139.0%
 
#154217.3%
 
,130914.7%
 
'5215.9%
 
/3944.4%
 
!3423.8%
 
@3363.8%
 
&2653.0%
 
:1471.7%
 
*1371.5%
 
%1141.3%
 
"961.1%
 
570.6%
 
?440.5%
 
270.3%
 
§130.1%
 
\130.1%
 
90.1%
 
90.1%
 
¡90.1%
 
70.1%
 
;70.1%
 
·50.1%
 
4< 0.1%
 
،2< 0.1%
 
Other values (16)230.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
275618.0%
 
171617.1%
 
059814.2%
 
443910.5%
 
73718.8%
 
33157.5%
 
93017.2%
 
52696.4%
 
82536.0%
 
61643.9%
 
2< 0.1%
 
2< 0.1%
 
2< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 
𝟟1< 0.1%
 
𝟝1< 0.1%
 
1< 0.1%
 
1< 0.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-122699.0%
 
50.4%
 
40.3%
 
20.2%
 
10.1%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
|29251.4%
 
+12421.8%
 
~7212.7%
 
=315.5%
 
183.2%
 
71.2%
 
40.7%
 
40.7%
 
30.5%
 
20.4%
 
20.4%
 
20.4%
 
20.4%
 
10.2%
 
÷10.2%
 
10.2%
 
10.2%
 
10.2%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_63099.2%
 
50.8%
 

Most frequent Nonspacing Mark characters

ValueCountFrequency (%) 
138749.0%
 
͟2609.2%
 
2127.5%
 
983.5%
 
873.1%
 
722.5%
 
672.4%
 
491.7%
 
341.2%
 
301.1%
 
َ271.0%
 
210.7%
 
ُ190.7%
 
ّ190.7%
 
ି180.6%
 
140.5%
 
ಿ110.4%
 
110.4%
 
ِ100.4%
 
90.3%
 
90.3%
 
ْ80.3%
 
80.3%
 
̶80.3%
 
̴80.3%
 
Other values (134)33411.8%
 

Most frequent Other Letter characters

ValueCountFrequency (%) 
ا4355.9%
 
2633.5%
 
ل2072.8%
 
ن2062.8%
 
م1902.6%
 
ی1732.3%
 
و1602.2%
 
ر1462.0%
 
ي1401.9%
 
1351.8%
 
1291.7%
 
1281.7%
 
1261.7%
 
1231.7%
 
د1211.6%
 
1191.6%
 
1191.6%
 
1191.6%
 
1071.4%
 
س1041.4%
 
ب1031.4%
 
891.2%
 
861.2%
 
761.0%
 
ع731.0%
 
Other values (730)373550.4%
 

Most frequent Modifier Symbol characters

ValueCountFrequency (%) 
🏻14844.0%
 
🏾3711.0%
 
🏼3510.4%
 
🏽319.2%
 
^236.8%
 
¯144.2%
 
`123.6%
 
🏿113.3%
 
92.7%
 
41.2%
 
¸41.2%
 
´41.2%
 
˃20.6%
 
˂20.6%
 

Most frequent Initial Punctuation characters

ValueCountFrequency (%) 
«4248.3%
 
3641.4%
 
910.3%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
8550.3%
 
4224.9%
 
»4224.9%
 

Most frequent Format characters

ValueCountFrequency (%) 
48337.1%
 
󠁧15211.7%
 
󠁢1279.7%
 
󠁿1279.7%
 
󠁳1027.8%
 
󠁣624.8%
 
󠁴624.8%
 
󠁷403.1%
 
󠁬403.1%
 
292.2%
 
󠁥251.9%
 
󠁮251.9%
 
80.6%
 
80.6%
 
70.5%
 
­60.5%
 

Most frequent Enclosing Mark characters

ValueCountFrequency (%) 
3572.9%
 
҉918.8%
 
48.3%
 

Most frequent Currency Symbol characters

ValueCountFrequency (%) 
$3042.3%
 
1216.9%
 
¤811.3%
 
79.9%
 
¥45.6%
 
¢34.2%
 
22.8%
 
£22.8%
 
11.4%
 
11.4%
 
11.4%
 

Most frequent Other Number characters

ValueCountFrequency (%) 
1633.3%
 
1020.8%
 
²714.6%
 
24.2%
 
¹24.2%
 
24.2%
 
³24.2%
 
12.1%
 
12.1%
 
¾12.1%
 
12.1%
 
12.1%
 
12.1%
 
12.1%
 

Most frequent Private Use characters

ValueCountFrequency (%) 
2242.3%
 
1019.2%
 
59.6%
 
59.6%
 
59.6%
 
59.6%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(74395.4%
 
[233.0%
 
{60.8%
 
20.3%
 
10.1%
 
10.1%
 
10.1%
 
10.1%
 
10.1%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)76295.1%
 
]243.0%
 
}60.7%
 
30.4%
 
20.2%
 
10.1%
 
10.1%
 
10.1%
 
10.1%
 

Most frequent Spacing Mark characters

ValueCountFrequency (%) 
28828.0%
 
ि19418.9%
 
11811.5%
 
585.6%
 
ி565.5%
 
484.7%
 
363.5%
 
242.3%
 
161.6%
 
151.5%
 
ਿ151.5%
 
121.2%
 
111.1%
 
ি90.9%
 
70.7%
 
70.7%
 
70.7%
 
70.7%
 
70.7%
 
70.7%
 
70.7%
 
60.6%
 
60.6%
 
50.5%
 
50.5%
 
Other values (27)565.5%
 

Most frequent Modifier Letter characters

ValueCountFrequency (%) 
2018.7%
 
98.4%
 
76.5%
 
76.5%
 
ʸ65.6%
 
65.6%
 
ʷ65.6%
 
54.7%
 
54.7%
 
43.7%
 
43.7%
 
43.7%
 
ʳ43.7%
 
43.7%
 
ـ32.8%
 
ˡ32.8%
 
32.8%
 
ʰ21.9%
 
21.9%
 
ʻ10.9%
 
10.9%
 
ˢ10.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin55609184.1%
 
Common9214613.9%
 
Devanagari30720.5%
 
Arabic26920.4%
 
Inherited25170.4%
 
Han11690.2%
 
Cyrillic7150.1%
 
Tamil6190.1%
 
Greek219< 0.1%
 
Kannada213< 0.1%
 
Katakana180< 0.1%
 
Oriya175< 0.1%
 
Bengali172< 0.1%
 
Gurmukhi150< 0.1%
 
Hebrew123< 0.1%
 
Thai118< 0.1%
 
Telugu108< 0.1%
 
Canadian_Aboriginal75< 0.1%
 
Hangul73< 0.1%
 
Gujarati61< 0.1%
 
Unknown52< 0.1%
 
Armenian44< 0.1%
 
Malayalam43< 0.1%
 
Ethiopic40< 0.1%
 
Hiragana39< 0.1%
 
Other values (29)272< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a5599110.1%
 
e462468.3%
 
i406867.3%
 
n361516.5%
 
r326695.9%
 
o301375.4%
 
s242744.4%
 
t240864.3%
 
l229044.1%
 
h190743.4%
 
u150922.7%
 
d129462.3%
 
m113122.0%
 
S111752.0%
 
c111442.0%
 
y98551.8%
 
M88361.6%
 
k87101.6%
 
A84541.5%
 
g72591.3%
 
C71021.3%
 
T69981.3%
 
D66231.2%
 
R60991.1%
 
N57461.0%
 
Other values (243)8652215.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
5558860.3%
 
.34713.8%
 
#15421.7%
 
,13091.4%
 
-12261.3%
 
)7620.8%
 
27560.8%
 
(7430.8%
 
🇺7340.8%
 
17160.8%
 
🇮7160.8%
 
🇳6960.8%
 
_6300.7%
 
05980.6%
 
💙5590.6%
 
'5210.6%
 
🇪4890.5%
 
44390.5%
 
🇸4030.4%
 
/3940.4%
 
😷3930.4%
 
73710.4%
 
!3420.4%
 
@3360.4%
 
🌈3230.4%
 
Other values (1424)1808919.6%
 

Most frequent Inherited characters

ValueCountFrequency (%) 
138755.1%
 
48319.2%
 
͟26010.3%
 
491.9%
 
351.4%
 
َ271.1%
 
ُ190.8%
 
ّ190.8%
 
ِ100.4%
 
80.3%
 
ْ80.3%
 
̶80.3%
 
̴80.3%
 
̵70.3%
 
̞50.2%
 
̄50.2%
 
͡50.2%
 
͂50.2%
 
͊50.2%
 
͑50.2%
 
̭50.2%
 
̙50.2%
 
́40.2%
 
͜40.2%
 
͛40.2%
 
Other values (68)1375.4%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا43516.2%
 
ل2077.7%
 
ن2067.7%
 
م1907.1%
 
ی1736.4%
 
و1605.9%
 
ر1465.4%
 
ي1405.2%
 
د1214.5%
 
س1043.9%
 
ب1033.8%
 
ع732.7%
 
ح602.2%
 
ج552.0%
 
ش401.5%
 
ز381.4%
 
ف371.4%
 
ه361.3%
 
ت361.3%
 
ٹ351.3%
 
غ341.3%
 
ق301.1%
 
ة291.1%
 
ہ250.9%
 
ص200.7%
 
Other values (26)1595.9%
 

Most frequent Han characters

ValueCountFrequency (%) 
11910.2%
 
11910.2%
 
11910.2%
 
655.6%
 
574.9%
 
564.8%
 
554.7%
 
554.7%
 
232.0%
 
232.0%
 
232.0%
 
121.0%
 
110.9%
 
110.9%
 
110.9%
 
80.7%
 
80.7%
 
80.7%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
60.5%
 
Other values (183)33828.9%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
3014.1%
 
209.4%
 
125.6%
 
125.6%
 
ಿ115.2%
 
115.2%
 
83.8%
 
83.8%
 
73.3%
 
73.3%
 
73.3%
 
62.8%
 
52.3%
 
52.3%
 
52.3%
 
52.3%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
31.4%
 
31.4%
 
31.4%
 
Other values (13)219.9%
 

Most frequent Unknown characters

ValueCountFrequency (%) 
2242.3%
 
1019.2%
 
59.6%
 
59.6%
 
59.6%
 
59.6%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
י1512.2%
 
א129.8%
 
ל108.1%
 
ר108.1%
 
פ86.5%
 
ס86.5%
 
ב86.5%
 
ה64.9%
 
נ64.9%
 
ד64.9%
 
מ54.1%
 
ש43.3%
 
ן43.3%
 
ת43.3%
 
ָ32.4%
 
ִ21.6%
 
ע10.8%
 
ט10.8%
 
צ10.8%
 
ק10.8%
 
ץ10.8%
 
ׁ10.8%
 
ֵ10.8%
 
ֲ10.8%
 
ַ10.8%
 
Other values (3)32.4%
 

Most frequent Ethiopic characters

ValueCountFrequency (%) 
512.5%
 
512.5%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
25.0%
 
12.5%
 
12.5%
 
12.5%
 
12.5%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
2513.9%
 
126.7%
 
116.1%
 
116.1%
 
116.1%
 
116.1%
 
105.6%
 
105.6%
 
105.6%
 
95.0%
 
95.0%
 
73.9%
 
52.8%
 
42.2%
 
42.2%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
10.6%
 
Other values (10)105.6%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
и7810.9%
 
н699.7%
 
т598.3%
 
к476.6%
 
у425.9%
 
С354.9%
 
п344.8%
 
с334.6%
 
о304.2%
 
а253.5%
 
в223.1%
 
м202.8%
 
е182.5%
 
є162.2%
 
л152.1%
 
я142.0%
 
р121.7%
 
А111.5%
 
҉91.3%
 
ь91.3%
 
ѕ91.3%
 
Р81.1%
 
д81.1%
 
К71.0%
 
П60.8%
 
Other values (32)7911.0%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
2889.4%
 
2638.6%
 
2126.9%
 
ि1946.3%
 
1354.4%
 
1294.2%
 
1284.2%
 
1264.1%
 
1234.0%
 
1183.8%
 
1073.5%
 
892.9%
 
872.8%
 
862.8%
 
762.5%
 
722.3%
 
712.3%
 
682.2%
 
672.2%
 
672.2%
 
632.1%
 
481.6%
 
381.2%
 
351.1%
 
341.1%
 
Other values (38)34811.3%
 

Most frequent Thai characters

ValueCountFrequency (%) 
86.8%
 
75.9%
 
75.9%
 
75.9%
 
65.1%
 
54.2%
 
54.2%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
Other values (18)2117.8%
 

Most frequent Greek characters

ValueCountFrequency (%) 
α5022.8%
 
ι188.2%
 
η156.8%
 
σ135.9%
 
Λ104.6%
 
ε83.7%
 
ν83.7%
 
τ73.2%
 
Δ62.7%
 
π52.3%
 
υ52.3%
 
κ52.3%
 
ο52.3%
 
Ξ41.8%
 
ς41.8%
 
Κ41.8%
 
ω41.8%
 
Β31.4%
 
Ο31.4%
 
β20.9%
 
λ20.9%
 
Ι20.9%
 
Γ20.9%
 
Ε20.9%
 
Ν20.9%
 
Other values (23)3013.7%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
615.4%
 
37.7%
 
37.7%
 
37.7%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 

Most frequent Canadian_Aboriginal characters

ValueCountFrequency (%) 
1013.3%
 
810.7%
 
68.0%
 
68.0%
 
68.0%
 
56.7%
 
56.7%
 
56.7%
 
34.0%
 
34.0%
 
34.0%
 
22.7%
 
22.7%
 
22.7%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
ղ1431.8%
 
օ511.4%
 
ց49.1%
 
հ36.8%
 
ֆ36.8%
 
յ24.5%
 
ք24.5%
 
Ե24.5%
 
Տ24.5%
 
Յ24.5%
 
ժ12.3%
 
Շ12.3%
 
ա12.3%
 
ե12.3%
 
Թ12.3%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
821.6%
 
821.6%
 
410.8%
 
410.8%
 
410.8%
 
410.8%
 
25.4%
 
Ϣ12.7%
 
Ϯ12.7%
 
12.7%
 

Most frequent Braille characters

ValueCountFrequency (%) 
327.3%
 
218.2%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
1510.0%
 
ਿ1510.0%
 
106.7%
 
96.0%
 
96.0%
 
96.0%
 
74.7%
 
74.7%
 
74.7%
 
74.7%
 
64.0%
 
53.3%
 
42.7%
 
42.7%
 
42.7%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
21.3%
 
21.3%
 
21.3%
 
Other values (5)53.3%
 

Most frequent Linear_B characters

ValueCountFrequency (%) 
𐂂1100.0%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
2414.0%
 
116.4%
 
105.8%
 
105.8%
 
ি95.2%
 
74.1%
 
74.1%
 
74.1%
 
63.5%
 
63.5%
 
52.9%
 
52.9%
 
52.9%
 
52.9%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
21.2%
 
21.2%
 
Other values (16)2011.6%
 

Most frequent Egyptian_Hieroglyphs characters

ValueCountFrequency (%) 
𓃬216.7%
 
𓆉216.7%
 
𓆩18.3%
 
𓆪18.3%
 
𓊈18.3%
 
𓊉18.3%
 
𓆏18.3%
 
𓅪18.3%
 
𓂆18.3%
 
𓆌18.3%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
45.5%
 
45.5%
 
45.5%
 
45.5%
 
45.5%
 
22.7%
 
22.7%
 
22.7%
 
22.7%
 
22.7%
 
22.7%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
Other values (27)2737.0%
 

Most frequent Lao characters

ValueCountFrequency (%) 
233.3%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
98.3%
 
98.3%
 
87.4%
 
76.5%
 
54.6%
 
ి54.6%
 
54.6%
 
54.6%
 
54.6%
 
54.6%
 
43.7%
 
43.7%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
Other values (6)65.6%
 

Most frequent Malayalam characters

ValueCountFrequency (%) 
614.0%
 
49.3%
 
49.3%
 
37.0%
 
37.0%
 
37.0%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 

Most frequent Tibetan characters

ValueCountFrequency (%) 
114.3%
 
114.3%
 
114.3%
 
114.3%
 
114.3%
 
114.3%
 
114.3%
 

Most frequent Bamum characters

ValueCountFrequency (%) 
𖤐2100.0%
 

Most frequent Tifinagh characters

ValueCountFrequency (%) 
624.0%
 
416.0%
 
416.0%
 
312.0%
 
28.0%
 
28.0%
 
28.0%
 
28.0%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
1550.0%
 
413.3%
 
26.7%
 
26.7%
 
26.7%
 
13.3%
 
13.3%
 
13.3%
 
13.3%
 
13.3%
 

Most frequent Tagbanwa characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
9815.8%
 
589.4%
 
ி569.0%
 
426.8%
 
396.3%
 
365.8%
 
365.8%
 
325.2%
 
284.5%
 
233.7%
 
213.4%
 
193.1%
 
193.1%
 
142.3%
 
121.9%
 
111.8%
 
101.6%
 
81.3%
 
71.1%
 
71.1%
 
61.0%
 
50.8%
 
50.8%
 
40.6%
 
30.5%
 
Other values (10)203.2%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
58.2%
 
58.2%
 
58.2%
 
34.9%
 
34.9%
 
34.9%
 
34.9%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
11.6%
 
11.6%
 
11.6%
 
િ11.6%
 
11.6%
 
11.6%
 
Other values (4)46.6%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
2112.0%
 
ି1810.3%
 
126.9%
 
116.3%
 
95.1%
 
95.1%
 
84.6%
 
84.6%
 
74.0%
 
74.0%
 
63.4%
 
63.4%
 
52.9%
 
52.9%
 
52.9%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
10.6%
 
Other values (9)95.1%
 

Most frequent Sharada characters

ValueCountFrequency (%) 
𑆳222.2%
 
𑆮111.1%
 
𑆴111.1%
 
𑆑111.1%
 
𑆱111.1%
 
𑆫111.1%
 
𑆽111.1%
 
𑆤111.1%
 

Most frequent Kayah_Li characters

ValueCountFrequency (%) 
333.3%
 
222.2%
 
111.1%
 
111.1%
 
111.1%
 
111.1%
 

Most frequent Javanese characters

ValueCountFrequency (%) 
266.7%
 
133.3%
 

Most frequent Balinese characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
342.9%
 
228.6%
 
114.3%
 
114.3%
 

Most frequent Khmer characters

ValueCountFrequency (%) 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 

Most frequent Cuneiform characters

ValueCountFrequency (%) 
𒇷133.3%
 
𒁯133.3%
 
𒅗133.3%
 

Most frequent Tai_Tham characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Old_South_Arabian characters

ValueCountFrequency (%) 
𐩱325.0%
 
𐩬216.7%
 
𐩡216.7%
 
𐩴18.3%
 
𐩤18.3%
 
𐩢18.3%
 
𐩷18.3%
 
𐩺18.3%
 

Most frequent Bopomofo characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Runic characters

ValueCountFrequency (%) 
1028.6%
 
1028.6%
 
514.3%
 
514.3%
 
514.3%
 

Most frequent Thaana characters

ValueCountFrequency (%) 
ވ116.7%
 
ަ116.7%
 
އ116.7%
 
ް116.7%
 
ޑ116.7%
 
ެ116.7%
 

Most frequent Limbu characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Tai_Viet characters

ValueCountFrequency (%) 
360.0%
 
120.0%
 
120.0%
 

Most frequent New_Tai_Lue characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Tai_Le characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Tagalog characters

ValueCountFrequency (%) 
220.0%
 
220.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 

Most frequent Yi characters

ValueCountFrequency (%) 
1100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII62709094.8%
 
None70571.1%
 
Enclosed Alphanum Sup59640.9%
 
Devanagari30750.5%
 
Math Alphanum28790.4%
 
Arabic27760.4%
 
VS14360.2%
 
CJK11690.2%
 
Misc Symbols9840.1%
 
Latin 1 Sup9570.1%
 
Punctuation7910.1%
 
Tags7620.1%
 
Cyrillic7150.1%
 
Emoticons6530.1%
 
Tamil6190.1%
 
Dingbats5750.1%
 
Diacriticals4590.1%
 
Phonetic Ext263< 0.1%
 
IPA Ext245< 0.1%
 
Letterlike Symbols215< 0.1%
 
Kannada213< 0.1%
 
Latin Ext A193< 0.1%
 
Katakana184< 0.1%
 
Oriya175< 0.1%
 
Bengali172< 0.1%
 
Other values (64)15570.2%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a559918.9%
 
555888.9%
 
e462467.4%
 
i406866.5%
 
n361515.8%
 
r326695.2%
 
o301374.8%
 
s242743.9%
 
t240863.8%
 
l229043.7%
 
h190743.0%
 
u150922.4%
 
d129462.1%
 
m113121.8%
 
S111751.8%
 
c111441.8%
 
y98551.6%
 
M88361.4%
 
k87101.4%
 
A84541.3%
 
g72591.2%
 
C71021.1%
 
T69981.1%
 
D66231.1%
 
R60991.0%
 
Other values (68)10767917.2%
 

Most frequent Enclosed Alphanum Sup characters

ValueCountFrequency (%) 
🇺73412.3%
 
🇮71612.0%
 
🇳69611.7%
 
🇪4898.2%
 
🇸4036.8%
 
🇬3205.4%
 
🇧3165.3%
 
🇦2774.6%
 
🇨2293.8%
 
🇷2053.4%
 
🇰1823.1%
 
🇵1682.8%
 
🇭1612.7%
 
🇱1532.6%
 
🇲1272.1%
 
🇹1161.9%
 
🇿1131.9%
 
🇼721.2%
 
🇩671.1%
 
🇫560.9%
 
🇾380.6%
 
🇯300.5%
 
🇴280.5%
 
🅰270.5%
 
🇽160.3%
 
Other values (55)2253.8%
 

Most frequent None characters

ValueCountFrequency (%) 
💙5597.9%
 
🌈3234.6%
 
🌍2824.0%
 
🏳2563.6%
 
🌊2072.9%
 
🏴1772.5%
 
🏻1482.1%
 
🌹1021.4%
 
💛961.4%
 
💉931.3%
 
💜921.3%
 
💚901.3%
 
🌎891.3%
 
🕷871.2%
 
🕊681.0%
 
🧡580.8%
 
🌐550.8%
 
🚩530.8%
 
🌱520.7%
 
🦋520.7%
 
α500.7%
 
🌺470.7%
 
🐝460.7%
 
👑450.6%
 
🌻450.6%
 
Other values (633)388555.1%
 

Most frequent Dingbats characters

ValueCountFrequency (%) 
11219.5%
 
7913.7%
 
569.7%
 
539.2%
 
417.1%
 
264.5%
 
213.7%
 
213.7%
 
162.8%
 
162.8%
 
162.8%
 
111.9%
 
111.9%
 
111.9%
 
101.7%
 
91.6%
 
91.6%
 
71.2%
 
50.9%
 
50.9%
 
40.7%
 
20.3%
 
20.3%
 
20.3%
 
20.3%
 
Other values (19)284.9%
 

Most frequent VS characters

ValueCountFrequency (%) 
138796.6%
 
493.4%
 

Most frequent Misc Symbols characters

ValueCountFrequency (%) 
656.6%
 
545.5%
 
515.2%
 
495.0%
 
484.9%
 
424.3%
 
373.8%
 
353.6%
 
333.4%
 
323.3%
 
303.0%
 
262.6%
 
252.5%
 
232.3%
 
212.1%
 
202.0%
 
191.9%
 
171.7%
 
171.7%
 
161.6%
 
121.2%
 
121.2%
 
121.2%
 
121.2%
 
121.2%
 
Other values (71)26426.8%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا43515.7%
 
ل2077.5%
 
ن2067.4%
 
م1906.8%
 
ی1736.2%
 
و1605.8%
 
ر1465.3%
 
ي1405.0%
 
د1214.4%
 
س1043.7%
 
ب1033.7%
 
ع732.6%
 
ح602.2%
 
ج552.0%
 
ش401.4%
 
ز381.4%
 
ف371.3%
 
ه361.3%
 
ت361.3%
 
ٹ351.3%
 
غ341.2%
 
ق301.1%
 
ة291.0%
 
َ271.0%
 
ہ250.9%
 
Other values (35)2368.5%
 

Most frequent Math Alphanum characters

ValueCountFrequency (%) 
𝐚742.6%
 
𝓪471.6%
 
𝖆471.6%
 
𝕚441.5%
 
𝕝441.5%
 
𝕟431.5%
 
𝓮421.5%
 
𝓲391.4%
 
𝖓391.4%
 
𝕖381.3%
 
𝕒371.3%
 
𝓵351.2%
 
𝐡331.1%
 
𝕙321.1%
 
𝖔311.1%
 
𝐧311.1%
 
𝖊301.0%
 
𝗲270.9%
 
𝖗260.9%
 
𝕠260.9%
 
𝐢250.9%
 
𝖙250.9%
 
𝚎240.8%
 
𝕤240.8%
 
𝗮230.8%
 
Other values (377)199369.2%
 

Most frequent Emoticons characters

ValueCountFrequency (%) 
😷39360.2%
 
🙏548.3%
 
🙂294.4%
 
😎253.8%
 
🙈152.3%
 
🙉111.7%
 
🙊111.7%
 
😍111.7%
 
😉111.7%
 
😡81.2%
 
🙌71.1%
 
😈71.1%
 
😆71.1%
 
😃50.8%
 
😊40.6%
 
🙃40.6%
 
😺40.6%
 
😇40.6%
 
😻40.6%
 
😀30.5%
 
🙋30.5%
 
😁30.5%
 
😬30.5%
 
😼20.3%
 
🙅20.3%
 
Other values (17)233.5%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
48361.1%
 
8510.7%
 
577.2%
 
425.3%
 
364.6%
 
293.7%
 
91.1%
 
81.0%
 
81.0%
 
70.9%
 
70.9%
 
50.6%
 
50.6%
 
40.5%
 
20.3%
 
20.3%
 
10.1%
 
10.1%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
é12813.4%
 
á12012.5%
 
í727.5%
 
®687.1%
 
«424.4%
 
»424.4%
 
ñ394.1%
 
ö353.7%
 
ó323.3%
 
©282.9%
 
ü272.8%
 
°242.5%
 
ä181.9%
 
¯141.5%
 
§131.4%
 
ò131.4%
 
Ó121.3%
 
ë111.1%
 
è111.1%
 
ç90.9%
 
¡90.9%
 
ú90.9%
 
ø90.9%
 
É80.8%
 
¤80.8%
 
Other values (49)15616.3%
 

Most frequent CJK characters

ValueCountFrequency (%) 
11910.2%
 
11910.2%
 
11910.2%
 
655.6%
 
574.9%
 
564.8%
 
554.7%
 
554.7%
 
232.0%
 
232.0%
 
232.0%
 
121.0%
 
110.9%
 
110.9%
 
110.9%
 
80.7%
 
80.7%
 
80.7%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
70.6%
 
60.5%
 
Other values (183)33828.9%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
3014.1%
 
209.4%
 
125.6%
 
125.6%
 
ಿ115.2%
 
115.2%
 
83.8%
 
83.8%
 
73.3%
 
73.3%
 
73.3%
 
62.8%
 
52.3%
 
52.3%
 
52.3%
 
52.3%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
41.9%
 
31.4%
 
31.4%
 
31.4%
 
Other values (13)219.9%
 

Most frequent PUA characters

ValueCountFrequency (%) 
2242.3%
 
1019.2%
 
59.6%
 
59.6%
 
59.6%
 
59.6%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
י1512.2%
 
א129.8%
 
ל108.1%
 
ר108.1%
 
פ86.5%
 
ס86.5%
 
ב86.5%
 
ה64.9%
 
נ64.9%
 
ד64.9%
 
מ54.1%
 
ש43.3%
 
ן43.3%
 
ת43.3%
 
ָ32.4%
 
ִ21.6%
 
ע10.8%
 
ט10.8%
 
צ10.8%
 
ק10.8%
 
ץ10.8%
 
ׁ10.8%
 
ֵ10.8%
 
ֲ10.8%
 
ַ10.8%
 
Other values (3)32.4%
 

Most frequent Ethiopic characters

ValueCountFrequency (%) 
512.5%
 
512.5%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
410.0%
 
25.0%
 
12.5%
 
12.5%
 
12.5%
 
12.5%
 

Most frequent Modifier Tone Letters characters

ValueCountFrequency (%) 
4100.0%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
2513.6%
 
126.5%
 
116.0%
 
116.0%
 
116.0%
 
116.0%
 
105.4%
 
105.4%
 
105.4%
 
94.9%
 
94.9%
 
73.8%
 
52.7%
 
42.2%
 
42.2%
 
42.2%
 
31.6%
 
31.6%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
Other values (11)116.0%
 

Most frequent Latin Ext A characters

ValueCountFrequency (%) 
ć2412.4%
 
Ć147.3%
 
ı136.7%
 
č126.2%
 
İ115.7%
 
ā115.7%
 
ř105.2%
 
ē94.7%
 
ş94.7%
 
ł94.7%
 
Š84.1%
 
Ş63.1%
 
š63.1%
 
Č52.6%
 
ğ42.1%
 
ń31.6%
 
ū31.6%
 
Ł31.6%
 
Ţ21.0%
 
Ż21.0%
 
Ī21.0%
 
ď10.5%
 
Ą10.5%
 
Ğ10.5%
 
ō10.5%
 
Other values (23)2311.9%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
и7810.9%
 
н699.7%
 
т598.3%
 
к476.6%
 
у425.9%
 
С354.9%
 
п344.8%
 
с334.6%
 
о304.2%
 
а253.5%
 
в223.1%
 
м202.8%
 
е182.5%
 
є162.2%
 
л152.1%
 
я142.0%
 
р121.7%
 
А111.5%
 
҉91.3%
 
ь91.3%
 
ѕ91.3%
 
Р81.1%
 
д81.1%
 
К71.0%
 
П60.8%
 
Other values (32)7911.0%
 

Most frequent Letterlike Symbols characters

ValueCountFrequency (%) 
15471.6%
 
188.4%
 
157.0%
 
52.3%
 
41.9%
 
20.9%
 
20.9%
 
20.9%
 
20.9%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 
10.5%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
2889.4%
 
2638.6%
 
2126.9%
 
ि1946.3%
 
1354.4%
 
1294.2%
 
1284.2%
 
1264.1%
 
1234.0%
 
1183.8%
 
1073.5%
 
892.9%
 
872.8%
 
862.8%
 
762.5%
 
722.3%
 
712.3%
 
682.2%
 
672.2%
 
672.2%
 
632.0%
 
481.6%
 
381.2%
 
351.1%
 
341.1%
 
Other values (40)35111.4%
 

Most frequent Tags characters

ValueCountFrequency (%) 
󠁧15219.9%
 
󠁢12716.7%
 
󠁿12716.7%
 
󠁳10213.4%
 
󠁣628.1%
 
󠁴628.1%
 
󠁷405.2%
 
󠁬405.2%
 
󠁥253.3%
 
󠁮253.3%
 

Most frequent Latin Ext Additional characters

ValueCountFrequency (%) 
426.7%
 
213.3%
 
213.3%
 
213.3%
 
213.3%
 
16.7%
 
16.7%
 
16.7%
 

Most frequent Thai characters

ValueCountFrequency (%) 
86.8%
 
75.9%
 
75.9%
 
75.9%
 
65.1%
 
54.2%
 
54.2%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
32.5%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
Other values (18)2117.8%
 

Most frequent Arabic PF A characters

ValueCountFrequency (%) 
9100.0%
 

Most frequent IPA Ext characters

ValueCountFrequency (%) 
ɴ3313.5%
 
ʀ3012.2%
 
ɪ239.4%
 
ʟ197.8%
 
ɑ176.9%
 
ɾ156.1%
 
ʏ124.9%
 
ʙ93.7%
 
ʜ83.3%
 
ɔ72.9%
 
ɢ62.4%
 
ʍ62.4%
 
ʇ52.0%
 
ɛ52.0%
 
ɨ41.6%
 
ɐ41.6%
 
ɱ41.6%
 
ɥ41.6%
 
ʕ41.6%
 
ʔ41.6%
 
ʅ41.6%
 
ɯ31.2%
 
ʎ31.2%
 
ɹ31.2%
 
ʞ20.8%
 
Other values (8)114.5%
 

Most frequent Phonetic Ext characters

ValueCountFrequency (%) 
5721.7%
 
2911.0%
 
207.6%
 
197.2%
 
176.5%
 
176.5%
 
134.9%
 
103.8%
 
72.7%
 
72.7%
 
72.7%
 
62.3%
 
62.3%
 
51.9%
 
51.9%
 
41.5%
 
41.5%
 
41.5%
 
41.5%
 
41.5%
 
41.5%
 
31.1%
 
31.1%
 
20.8%
 
20.8%
 
Other values (4)41.5%
 

Most frequent Arrows characters

ValueCountFrequency (%) 
675.0%
 
112.5%
 
112.5%
 

Most frequent Diacriticals characters

ValueCountFrequency (%) 
͟26056.6%
 
̶81.7%
 
̴81.7%
 
̵71.5%
 
̞51.1%
 
̄51.1%
 
͡51.1%
 
͂51.1%
 
͊51.1%
 
͑51.1%
 
̭51.1%
 
̙51.1%
 
́40.9%
 
͜40.9%
 
͛40.9%
 
ͪ40.9%
 
̟40.9%
 
͙40.9%
 
̦30.7%
 
̜30.7%
 
̒30.7%
 
̯30.7%
 
̸30.7%
 
̃30.7%
 
̎30.7%
 
Other values (52)9119.8%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
615.4%
 
37.7%
 
37.7%
 
37.7%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
25.1%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 
12.6%
 

Most frequent Misc Technical characters

ValueCountFrequency (%) 
936.0%
 
728.0%
 
520.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 

Most frequent UCAS characters

ValueCountFrequency (%) 
1013.3%
 
810.7%
 
68.0%
 
68.0%
 
68.0%
 
56.7%
 
56.7%
 
56.7%
 
34.0%
 
34.0%
 
34.0%
 
22.7%
 
22.7%
 
22.7%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 
11.3%
 

Most frequent Enclosed Alphanum characters

ValueCountFrequency (%) 
2625.0%
 
2221.2%
 
54.8%
 
54.8%
 
43.8%
 
43.8%
 
32.9%
 
32.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
Other values (6)65.8%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
ղ1431.8%
 
օ511.4%
 
ց49.1%
 
հ36.8%
 
ֆ36.8%
 
յ24.5%
 
ք24.5%
 
Ե24.5%
 
Տ24.5%
 
Յ24.5%
 
ժ12.3%
 
Շ12.3%
 
ա12.3%
 
ե12.3%
 
Թ12.3%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
822.9%
 
822.9%
 
411.4%
 
411.4%
 
411.4%
 
411.4%
 
25.7%
 
12.9%
 

Most frequent Braille characters

ValueCountFrequency (%) 
327.3%
 
218.2%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
1510.0%
 
ਿ1510.0%
 
106.7%
 
96.0%
 
96.0%
 
96.0%
 
74.7%
 
74.7%
 
74.7%
 
74.7%
 
64.0%
 
53.3%
 
42.7%
 
42.7%
 
42.7%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
32.0%
 
21.3%
 
21.3%
 
21.3%
 
Other values (5)53.3%
 

Most frequent Math Operators characters

ValueCountFrequency (%) 
1845.0%
 
717.5%
 
410.0%
 
410.0%
 
25.0%
 
25.0%
 
12.5%
 
12.5%
 
12.5%
 

Most frequent Currency Symbols characters

ValueCountFrequency (%) 
1252.2%
 
730.4%
 
28.7%
 
14.3%
 
14.3%
 

Most frequent Latin Ext B characters

ValueCountFrequency (%) 
ǟ516.7%
 
ƃ413.3%
 
ȝ413.3%
 
ǝ310.0%
 
ƛ310.0%
 
Ƹ26.7%
 
Ʒ13.3%
 
Ƭ13.3%
 
Ɩ13.3%
 
ȶ13.3%
 
ƈ13.3%
 
Ⱥ13.3%
 
ƫ13.3%
 
Ɔ13.3%
 
Ɓ13.3%
 

Most frequent Linear B Ideograms characters

ValueCountFrequency (%) 
𐂂1100.0%
 

Most frequent Modifier Letters characters

ValueCountFrequency (%) 
ʸ622.2%
 
ʷ622.2%
 
ʳ414.8%
 
ˡ311.1%
 
ʰ27.4%
 
˃27.4%
 
˂27.4%
 
ʻ13.7%
 
ˢ13.7%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
2414.0%
 
116.4%
 
105.8%
 
105.8%
 
ি95.2%
 
74.1%
 
74.1%
 
74.1%
 
63.5%
 
63.5%
 
52.9%
 
52.9%
 
52.9%
 
52.9%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
21.2%
 
21.2%
 
Other values (16)2011.6%
 

Most frequent Latin Ext C characters

ValueCountFrequency (%) 
250.0%
 
125.0%
 
125.0%
 

Most frequent Egyptian Hieroglyphs characters

ValueCountFrequency (%) 
𓃬216.7%
 
𓆉216.7%
 
𓆩18.3%
 
𓆪18.3%
 
𓊈18.3%
 
𓊉18.3%
 
𓆏18.3%
 
𓅪18.3%
 
𓂆18.3%
 
𓆌18.3%
 

Most frequent Playing Cards characters

ValueCountFrequency (%) 
🃏3100.0%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
45.7%
 
45.7%
 
45.7%
 
45.7%
 
45.7%
 
22.9%
 
22.9%
 
22.9%
 
22.9%
 
22.9%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
11.4%
 
Other values (25)2535.7%
 

Most frequent Lao characters

ValueCountFrequency (%) 
233.3%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
98.3%
 
98.3%
 
87.4%
 
76.5%
 
54.6%
 
ి54.6%
 
54.6%
 
54.6%
 
54.6%
 
54.6%
 
43.7%
 
43.7%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
Other values (6)65.6%
 

Most frequent Malayalam characters

ValueCountFrequency (%) 
614.0%
 
49.3%
 
49.3%
 
37.0%
 
37.0%
 
37.0%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
24.7%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 
12.3%
 

Most frequent Geometric Shapes characters

ValueCountFrequency (%) 
923.7%
 
513.2%
 
513.2%
 
410.5%
 
410.5%
 
25.3%
 
25.3%
 
25.3%
 
25.3%
 
12.6%
 
12.6%
 
12.6%
 

Most frequent Box Drawing characters

ValueCountFrequency (%) 
323.1%
 
323.1%
 
215.4%
 
215.4%
 
17.7%
 
17.7%
 
17.7%
 

Most frequent Tibetan characters

ValueCountFrequency (%) 
112.5%
 
112.5%
 
112.5%
 
112.5%
 
112.5%
 
112.5%
 
112.5%
 
112.5%
 

Most frequent Bamum Sup characters

ValueCountFrequency (%) 
𖤐2100.0%
 

Most frequent Tifinagh characters

ValueCountFrequency (%) 
624.0%
 
416.0%
 
416.0%
 
312.0%
 
28.0%
 
28.0%
 
28.0%
 
28.0%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
1550.0%
 
413.3%
 
26.7%
 
26.7%
 
26.7%
 
13.3%
 
13.3%
 
13.3%
 
13.3%
 
13.3%
 

Most frequent Tagbanwa characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
9815.8%
 
589.4%
 
ி569.0%
 
426.8%
 
396.3%
 
365.8%
 
365.8%
 
325.2%
 
284.5%
 
233.7%
 
213.4%
 
193.1%
 
193.1%
 
142.3%
 
121.9%
 
111.8%
 
101.6%
 
81.3%
 
71.1%
 
71.1%
 
61.0%
 
50.8%
 
50.8%
 
40.6%
 
30.5%
 
Other values (10)203.2%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
58.2%
 
58.2%
 
58.2%
 
34.9%
 
34.9%
 
34.9%
 
34.9%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
23.3%
 
11.6%
 
11.6%
 
11.6%
 
િ11.6%
 
11.6%
 
11.6%
 
Other values (4)46.6%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
2112.0%
 
ି1810.3%
 
126.9%
 
116.3%
 
95.1%
 
95.1%
 
84.6%
 
84.6%
 
74.0%
 
74.0%
 
63.4%
 
63.4%
 
52.9%
 
52.9%
 
52.9%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
10.6%
 
Other values (9)95.1%
 

Most frequent Sharada characters

ValueCountFrequency (%) 
𑆳222.2%
 
𑆮111.1%
 
𑆴111.1%
 
𑆑111.1%
 
𑆱111.1%
 
𑆫111.1%
 
𑆽111.1%
 
𑆤111.1%
 

Most frequent Kayah Li characters

ValueCountFrequency (%) 
333.3%
 
222.2%
 
111.1%
 
111.1%
 
111.1%
 
111.1%
 

Most frequent Geometric Shapes Ext characters

ValueCountFrequency (%) 
🟣375.0%
 
🟥125.0%
 

Most frequent Javanese characters

ValueCountFrequency (%) 
266.7%
 
133.3%
 

Most frequent Balinese characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
342.9%
 
228.6%
 
114.3%
 
114.3%
 

Most frequent Misc Math Symbols A characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Khmer characters

ValueCountFrequency (%) 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 
311.1%
 

Most frequent Cuneiform characters

ValueCountFrequency (%) 
𒇷133.3%
 
𒁯133.3%
 
𒅗133.3%
 

Most frequent Latin Ext D characters

ValueCountFrequency (%) 
480.0%
 
120.0%
 

Most frequent Tai Tham characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Old South Arabian characters

ValueCountFrequency (%) 
𐩱325.0%
 
𐩬216.7%
 
𐩡216.7%
 
𐩴18.3%
 
𐩤18.3%
 
𐩢18.3%
 
𐩷18.3%
 
𐩺18.3%
 

Most frequent Compat Jamo characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Bopomofo characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Jamo characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent CJK Compat Forms characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Runic characters

ValueCountFrequency (%) 
1028.6%
 
1028.6%
 
514.3%
 
514.3%
 
514.3%
 

Most frequent Thaana characters

ValueCountFrequency (%) 
ވ116.7%
 
ަ116.7%
 
އ116.7%
 
ް116.7%
 
ޑ116.7%
 
ެ116.7%
 

Most frequent Mahjong characters

ValueCountFrequency (%) 
🀄2100.0%
 

Most frequent Limbu characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Tai Viet characters

ValueCountFrequency (%) 
360.0%
 
120.0%
 
120.0%
 

Most frequent New Tai Lue characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Tai Le characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Indic Number Forms characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Misc Math Symbols B characters

ValueCountFrequency (%) 
3100.0%
 

Most frequent Tagalog characters

ValueCountFrequency (%) 
220.0%
 
220.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 
110.0%
 

Most frequent Yi Radicals characters

ValueCountFrequency (%) 
1100.0%
 

user_location
Categorical

HIGH CARDINALITY
MISSING

Distinct9345
Distinct (%)26.2%
Missing10365
Missing (%)22.5%
Memory size360.0 KiB
India
 
1237
Toronto, Canada and Worldwide
 
1026
New Delhi, India
 
529
United States
 
468
London, England
 
411
Other values (9340)
32023 
ValueCountFrequency (%) 
India12372.7%
 
Toronto, Canada and Worldwide10262.2%
 
New Delhi, India5291.1%
 
United States4681.0%
 
London, England4110.9%
 
Beijing, China3990.9%
 
Mumbai, India3360.7%
 
London3070.7%
 
Beijing3030.7%
 
New Delhi2360.5%
 
United Kingdom2120.5%
 
New York, NY1880.4%
 
Moscow, Russia1830.4%
 
USA1800.4%
 
Canada1760.4%
 
Los Angeles, CA1720.4%
 
Moscow, Russia 1700.4%
 
Malaysia1640.4%
 
Washington, DC1610.3%
 
UK1500.3%
 
Mumbai1490.3%
 
Hong Kong1470.3%
 
Nairobi, Kenya1460.3%
 
California, USA1440.3%
 
Pakistan1440.3%
 
Other values (9320)2795660.7%
 
(Missing)1036522.5%
 
2021-05-15T20:26:50.072168image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique6277 ?
Unique (%)17.6%
2021-05-15T20:26:50.413305image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length129
Median length11
Mean length11.78835841
Min length1

Overview of Unicode Properties

Unique unicode characters920
Unique unicode categories23 ?
Unique unicode scripts29 ?
Unique unicode blocks56 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a5874710.8%
 
n5765110.6%
 
455058.4%
 
i323116.0%
 
e312225.8%
 
o270205.0%
 
r227244.2%
 
d201223.7%
 
,194343.6%
 
l187033.4%
 
t186353.4%
 
s151992.8%
 
h112132.1%
 
u97991.8%
 
g87881.6%
 
C77551.4%
 
A75351.4%
 
S65781.2%
 
I62571.2%
 
m60891.1%
 
w57261.1%
 
N55231.0%
 
c54381.0%
 
b53601.0%
 
y46790.9%
 
Other values (895)8494715.6%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter37558769.2%
 
Uppercase Letter8533915.7%
 
Space Separator455098.4%
 
Other Punctuation233104.3%
 
Decimal Number42410.8%
 
Other Letter41320.8%
 
Other Symbol19460.4%
 
Dash Punctuation8580.2%
 
Nonspacing Mark5800.1%
 
Spacing Mark5270.1%
 
Math Symbol4110.1%
 
Close Punctuation176< 0.1%
 
Open Punctuation171< 0.1%
 
Format45< 0.1%
 
Modifier Symbol34< 0.1%
 
Final Punctuation32< 0.1%
 
Connector Punctuation22< 0.1%
 
Control14< 0.1%
 
Initial Punctuation10< 0.1%
 
Modifier Letter8< 0.1%
 
Currency Symbol3< 0.1%
 
Enclosing Mark3< 0.1%
 
Other Number2< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C77559.1%
 
A75358.8%
 
S65787.7%
 
I62577.3%
 
N55236.5%
 
U46135.4%
 
B45415.3%
 
M43625.1%
 
L40774.8%
 
T40434.7%
 
E35034.1%
 
D34664.1%
 
W31633.7%
 
P30563.6%
 
K29503.5%
 
R19702.3%
 
H19392.3%
 
O18042.1%
 
Y16381.9%
 
G16381.9%
 
F14821.7%
 
V11281.3%
 
J7530.9%
 
X4930.6%
 
Z4170.5%
 
Other values (68)6550.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a5874715.6%
 
n5765115.3%
 
i323118.6%
 
e312228.3%
 
o270207.2%
 
r227246.1%
 
d201225.4%
 
l187035.0%
 
t186355.0%
 
s151994.0%
 
h112133.0%
 
u97992.6%
 
g87882.3%
 
m60891.6%
 
w57261.5%
 
c54381.4%
 
b53601.4%
 
y46791.2%
 
k40171.1%
 
p33060.9%
 
f25060.7%
 
v21920.6%
 
j12920.3%
 
x7970.2%
 
z6120.2%
 
Other values (153)14390.4%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
45505> 99.9%
 
 3< 0.1%
 
 1< 0.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-85199.2%
 
40.5%
 
30.3%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
,1943483.4%
 
.17527.5%
 
/7583.3%
 
&2971.3%
 
#2811.2%
 
:2251.0%
 
'1950.8%
 
!1260.5%
 
@1050.5%
 
320.1%
 
?250.1%
 
"180.1%
 
;150.1%
 
*150.1%
 
،6< 0.1%
 
%5< 0.1%
 
4< 0.1%
 
3< 0.1%
 
\2< 0.1%
 
2< 0.1%
 
2< 0.1%
 
1< 0.1%
 
·1< 0.1%
 
¿1< 0.1%
 
1< 0.1%
 
Other values (4)4< 0.1%
 

Most frequent Other Letter characters

ValueCountFrequency (%) 
2756.7%
 
ا2636.4%
 
1944.7%
 
1593.8%
 
ل1263.0%
 
1002.4%
 
ر982.4%
 
942.3%
 
902.2%
 
892.2%
 
872.1%
 
872.1%
 
872.1%
 
872.1%
 
872.1%
 
852.1%
 
م832.0%
 
ة731.8%
 
ت721.7%
 
641.5%
 
611.5%
 
591.4%
 
551.3%
 
531.3%
 
د521.3%
 
Other values (246)155237.6%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
🇺1256.4%
 
🇳1055.4%
 
🇸964.9%
 
🇮844.3%
 
🇬824.2%
 
🇪743.8%
 
🇧733.8%
 
🇦683.5%
 
🌍593.0%
 
🇨552.8%
 
🇵532.7%
 
🌎422.2%
 
°412.1%
 
🇰382.0%
 
371.9%
 
🌏291.5%
 
291.5%
 
🇱281.4%
 
💶261.3%
 
💵261.3%
 
💷261.3%
 
251.3%
 
🏆241.2%
 
231.2%
 
🇷221.1%
 
Other values (196)65633.7%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
|24258.9%
 
+7518.2%
 
=4711.4%
 
184.4%
 
~163.9%
 
30.7%
 
30.7%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 

Most frequent Nonspacing Mark characters

ValueCountFrequency (%) 
15526.7%
 
12922.2%
 
6911.9%
 
467.9%
 
406.9%
 
284.8%
 
284.8%
 
142.4%
 
111.9%
 
91.6%
 
61.0%
 
61.0%
 
50.9%
 
ಿ40.7%
 
40.7%
 
40.7%
 
ُ30.5%
 
20.3%
 
20.3%
 
20.3%
 
20.3%
 
ି10.2%
 
ٍ10.2%
 
َ10.2%
 
10.2%
 
Other values (7)71.2%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
065115.4%
 
159013.9%
 
249211.6%
 
745310.7%
 
343510.3%
 
93608.5%
 
53458.1%
 
43428.1%
 
82906.8%
 
62826.6%
 
1< 0.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(16495.9%
 
[52.9%
 
10.6%
 
10.6%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)16996.0%
 
]52.8%
 
10.6%
 
10.6%
 

Most frequent Modifier Symbol characters

ValueCountFrequency (%) 
🏾1235.3%
 
🏻1235.3%
 
🏽38.8%
 
🏼25.9%
 
¯25.9%
 
🏿25.9%
 
^12.9%
 

Most frequent Format characters

ValueCountFrequency (%) 
2044.4%
 
󠁧511.1%
 
󠁢48.9%
 
󠁿48.9%
 
󠁳36.7%
 
󠁣24.4%
 
󠁴24.4%
 
󠁷12.2%
 
󠁬12.2%
 
12.2%
 
󠁥12.2%
 
󠁮12.2%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
2371.9%
 
618.8%
 
»39.4%
 

Most frequent Spacing Mark characters

ValueCountFrequency (%) 
31760.2%
 
ि7213.7%
 
6712.7%
 
214.0%
 
81.5%
 
71.3%
 
71.3%
 
ி50.9%
 
30.6%
 
30.6%
 
20.4%
 
20.4%
 
20.4%
 
20.4%
 
20.4%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 

Most frequent Control characters

ValueCountFrequency (%) 
14100.0%
 

Most frequent Modifier Letter characters

ValueCountFrequency (%) 
450.0%
 
ـ225.0%
 
ʻ225.0%
 

Most frequent Initial Punctuation characters

ValueCountFrequency (%) 
660.0%
 
«330.0%
 
110.0%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_22100.0%
 

Most frequent Other Number characters

ValueCountFrequency (%) 
½150.0%
 
150.0%
 

Most frequent Currency Symbol characters

ValueCountFrequency (%) 
$266.7%
 
133.3%
 

Most frequent Enclosing Mark characters

ValueCountFrequency (%) 
3100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin46007684.7%
 
Common7687714.2%
 
Devanagari22380.4%
 
Arabic11900.2%
 
Han9900.2%
 
Cyrillic6070.1%
 
Thai252< 0.1%
 
Inherited198< 0.1%
 
Greek94< 0.1%
 
Kannada90< 0.1%
 
Katakana86< 0.1%
 
Tamil68< 0.1%
 
Hangul42< 0.1%
 
Telugu29< 0.1%
 
Braille23< 0.1%
 
Oriya16< 0.1%
 
Hebrew15< 0.1%
 
Canadian_Aboriginal13< 0.1%
 
Gujarati12< 0.1%
 
Cherokee11< 0.1%
 
Bengali11< 0.1%
 
Coptic6< 0.1%
 
Gurmukhi6< 0.1%
 
Hiragana4< 0.1%
 
Yi2< 0.1%
 
Other values (4)4< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a5874712.8%
 
n5765112.5%
 
i323117.0%
 
e312226.8%
 
o270205.9%
 
r227244.9%
 
d201224.4%
 
l187034.1%
 
t186354.1%
 
s151993.3%
 
h112132.4%
 
u97992.1%
 
g87881.9%
 
C77551.7%
 
A75351.6%
 
S65781.4%
 
I62571.4%
 
m60891.3%
 
w57261.2%
 
N55231.2%
 
c54381.2%
 
b53601.2%
 
y46791.0%
 
U46131.0%
 
B45411.0%
 
Other values (86)5784812.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
4550559.2%
 
,1943425.3%
 
.17522.3%
 
-8511.1%
 
/7581.0%
 
06510.8%
 
15900.8%
 
24920.6%
 
74530.6%
 
34350.6%
 
93600.5%
 
53450.4%
 
43420.4%
 
&2970.4%
 
82900.4%
 
62820.4%
 
#2810.4%
 
|2420.3%
 
:2250.3%
 
'1950.3%
 
)1690.2%
 
(1640.2%
 
!1260.2%
 
🇺1250.2%
 
@1050.1%
 
Other values (369)24083.1%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا26322.1%
 
ل12610.6%
 
ر988.2%
 
م837.0%
 
ة736.1%
 
ت726.1%
 
د524.4%
 
ب504.2%
 
ي484.0%
 
ن423.5%
 
س352.9%
 
ی352.9%
 
ع332.8%
 
و252.1%
 
ح242.0%
 
ک171.4%
 
ہ161.3%
 
پ131.1%
 
ز121.0%
 
إ90.8%
 
ق90.8%
 
ج80.7%
 
ش80.7%
 
ض60.5%
 
أ60.5%
 
Other values (10)272.3%
 

Most frequent Inherited characters

ValueCountFrequency (%) 
15578.3%
 
2010.1%
 
147.1%
 
ُ31.5%
 
31.5%
 
ٍ10.5%
 
َ10.5%
 
ً10.5%
 

Most frequent Han characters

ValueCountFrequency (%) 
949.5%
 
909.1%
 
899.0%
 
878.8%
 
878.8%
 
878.8%
 
878.8%
 
878.8%
 
858.6%
 
171.7%
 
151.5%
 
121.2%
 
121.2%
 
111.1%
 
101.0%
 
101.0%
 
80.8%
 
60.6%
 
60.6%
 
60.6%
 
40.4%
 
40.4%
 
40.4%
 
40.4%
 
40.4%
 
Other values (31)646.5%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
с6410.5%
 
и538.7%
 
а498.1%
 
о477.7%
 
е325.3%
 
я315.1%
 
р294.8%
 
н274.4%
 
к244.0%
 
Р233.8%
 
Л172.8%
 
С172.8%
 
в162.6%
 
т142.3%
 
б142.3%
 
М132.1%
 
А132.1%
 
г101.6%
 
К91.5%
 
л91.5%
 
Е91.5%
 
й91.5%
 
у81.3%
 
м71.2%
 
П71.2%
 
Other values (24)569.2%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
Other values (11)1126.2%
 

Most frequent Greek characters

ValueCountFrequency (%) 
Λ1617.0%
 
Ε1111.7%
 
Α99.6%
 
α99.6%
 
Σ88.5%
 
λ77.4%
 
ά44.3%
 
ν44.3%
 
ς33.2%
 
τ33.2%
 
ο22.1%
 
ι22.1%
 
22.1%
 
υ22.1%
 
Κ11.1%
 
ζ11.1%
 
η11.1%
 
κ11.1%
 
ή11.1%
 
β11.1%
 
σ11.1%
 
ε11.1%
 
ί11.1%
 
11.1%
 
ρ11.1%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
31714.2%
 
27512.3%
 
1948.7%
 
1597.1%
 
1295.8%
 
1004.5%
 
ि723.2%
 
693.1%
 
673.0%
 
642.9%
 
612.7%
 
592.6%
 
552.5%
 
532.4%
 
522.3%
 
462.1%
 
431.9%
 
411.8%
 
401.8%
 
401.8%
 
341.5%
 
271.2%
 
221.0%
 
210.9%
 
210.9%
 
Other values (26)1777.9%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
436.4%
 
436.4%
 
218.2%
 
19.1%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
Ծ1100.0%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
233.3%
 
233.3%
 
Ϯ116.7%
 
116.7%
 

Most frequent Canadian_Aboriginal characters

ValueCountFrequency (%) 
323.1%
 
215.4%
 
215.4%
 
215.4%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 

Most frequent Yi characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
1213.3%
 
1213.3%
 
1011.1%
 
88.9%
 
77.8%
 
66.7%
 
66.7%
 
66.7%
 
ಿ44.4%
 
33.3%
 
33.3%
 
22.2%
 
22.2%
 
22.2%
 
22.2%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
י320.0%
 
ש320.0%
 
ר320.0%
 
א320.0%
 
ל320.0%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
1820.9%
 
1214.0%
 
1112.8%
 
910.5%
 
910.5%
 
910.5%
 
44.7%
 
22.3%
 
22.3%
 
22.3%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
913.2%
 
811.8%
 
710.3%
 
710.3%
 
ி57.4%
 
45.9%
 
45.9%
 
34.4%
 
34.4%
 
34.4%
 
22.9%
 
22.9%
 
22.9%
 
22.9%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
318.8%
 
318.8%
 
212.5%
 
212.5%
 
212.5%
 
16.2%
 
16.2%
 
ି16.2%
 
16.2%
 

Most frequent Thai characters

ValueCountFrequency (%) 
2811.1%
 
2811.1%
 
2811.1%
 
2811.1%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
413.8%
 
310.3%
 
310.3%
 
26.9%
 
26.9%
 
26.9%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
ి13.4%
 
13.4%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
216.7%
 
216.7%
 
216.7%
 
216.7%
 
216.7%
 
216.7%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
218.2%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
233.3%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Braille characters

ValueCountFrequency (%) 
23100.0%
 

Most frequent Syriac characters

ValueCountFrequency (%) 
݁1100.0%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Tangut characters

ValueCountFrequency (%) 
𗀠1100.0%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
125.0%
 
125.0%
 
125.0%
 
125.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII53392198.3%
 
Devanagari22390.4%
 
Arabic12040.2%
 
Enclosed Alphanum Sup10200.2%
 
CJK9900.2%
 
None7440.1%
 
Latin 1 Sup7330.1%
 
Cyrillic6040.1%
 
Thai252< 0.1%
 
VS169< 0.1%
 
Dingbats150< 0.1%
 
Math Alphanum120< 0.1%
 
Punctuation105< 0.1%
 
Kannada90< 0.1%
 
Katakana90< 0.1%
 
Misc Symbols85< 0.1%
 
Tamil68< 0.1%
 
Latin Ext A49< 0.1%
 
Hangul42< 0.1%
 
Telugu29< 0.1%
 
Tags24< 0.1%
 
Braille23< 0.1%
 
Emoticons21< 0.1%
 
IPA Ext18< 0.1%
 
Oriya16< 0.1%
 
Other values (31)154< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a5874711.0%
 
n5765110.8%
 
455058.5%
 
i323116.1%
 
e312225.8%
 
o270205.1%
 
r227244.3%
 
d201223.8%
 
,194343.6%
 
l187033.5%
 
t186353.5%
 
s151992.8%
 
h112132.1%
 
u97991.8%
 
g87881.6%
 
C77551.5%
 
A75351.4%
 
S65781.2%
 
I62571.2%
 
m60891.1%
 
w57261.1%
 
N55231.0%
 
c54381.0%
 
b53601.0%
 
y46790.9%
 
Other values (66)7590814.2%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
ü18825.6%
 
ã10113.8%
 
Ü8010.9%
 
é8010.9%
 
á648.7%
 
°415.6%
 
ó304.1%
 
ë253.4%
 
ñ223.0%
 
Ö111.5%
 
ú101.4%
 
ö91.2%
 
í81.1%
 
ä71.0%
 
ï71.0%
 
è60.8%
 
à50.7%
 
â40.5%
 
ø40.5%
 
É30.4%
 
 30.4%
 
»30.4%
 
«30.4%
 
Ú30.4%
 
¯20.3%
 
Other values (11)141.9%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا26321.8%
 
ل12610.5%
 
ر988.1%
 
م836.9%
 
ة736.1%
 
ت726.0%
 
د524.3%
 
ب504.2%
 
ي484.0%
 
ن423.5%
 
س352.9%
 
ی352.9%
 
ع332.7%
 
و252.1%
 
ح242.0%
 
ک171.4%
 
ہ161.3%
 
پ131.1%
 
ز121.0%
 
إ90.7%
 
ق90.7%
 
ج80.7%
 
ش80.7%
 
ض60.5%
 
أ60.5%
 
Other values (16)413.4%
 

Most frequent Enclosed Alphanum Sup characters

ValueCountFrequency (%) 
🇺12512.3%
 
🇳10510.3%
 
🇸969.4%
 
🇮848.2%
 
🇬828.0%
 
🇪747.3%
 
🇧737.2%
 
🇦686.7%
 
🇨555.4%
 
🇵535.2%
 
🇰383.7%
 
🇱282.7%
 
🇷222.2%
 
🇹202.0%
 
🇲191.9%
 
🇯191.9%
 
🇩141.4%
 
🇭101.0%
 
🇴101.0%
 
🇫90.9%
 
🇽40.4%
 
🇼40.4%
 
🇾30.3%
 
🇿30.3%
 
🇶10.1%
 

Most frequent Dingbats characters

ValueCountFrequency (%) 
3724.7%
 
2919.3%
 
2516.7%
 
85.3%
 
74.7%
 
64.0%
 
64.0%
 
64.0%
 
53.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
10.7%
 
10.7%
 
10.7%
 

Most frequent VS characters

ValueCountFrequency (%) 
15591.7%
 
148.3%
 

Most frequent None characters

ValueCountFrequency (%) 
🌍597.9%
 
🌎425.6%
 
🌏293.9%
 
💶263.5%
 
💵263.5%
 
💷263.5%
 
🏆243.2%
 
182.4%
 
🌐172.3%
 
💙162.2%
 
Λ162.2%
 
📍152.0%
 
🌊131.7%
 
🏾121.6%
 
🏻121.6%
 
Ε111.5%
 
💜111.5%
 
👱101.3%
 
🦞91.2%
 
🦁91.2%
 
Α91.2%
 
α91.2%
 
👩81.1%
 
💻81.1%
 
🏡81.1%
 
Other values (142)30140.5%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
3230.5%
 
2321.9%
 
2019.0%
 
65.7%
 
65.7%
 
43.8%
 
43.8%
 
32.9%
 
32.9%
 
21.9%
 
11.0%
 
11.0%
 

Most frequent Misc Symbols characters

ValueCountFrequency (%) 
1011.8%
 
1011.8%
 
78.2%
 
67.1%
 
67.1%
 
55.9%
 
55.9%
 
44.7%
 
44.7%
 
33.5%
 
33.5%
 
33.5%
 
33.5%
 
22.4%
 
22.4%
 
22.4%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 

Most frequent Latin Ext A characters

ValueCountFrequency (%) 
İ1326.5%
 
č1326.5%
 
Č816.3%
 
ā510.2%
 
ş36.1%
 
ł12.0%
 
ğ12.0%
 
œ12.0%
 
Ā12.0%
 
Ŧ12.0%
 
ī12.0%
 
Ş12.0%
 

Most frequent CJK characters

ValueCountFrequency (%) 
949.5%
 
909.1%
 
899.0%
 
878.8%
 
878.8%
 
878.8%
 
878.8%
 
878.8%
 
858.6%
 
171.7%
 
151.5%
 
121.2%
 
121.2%
 
111.1%
 
101.0%
 
101.0%
 
80.8%
 
60.6%
 
60.6%
 
60.6%
 
40.4%
 
40.4%
 
40.4%
 
40.4%
 
40.4%
 
Other values (31)646.5%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
с6410.6%
 
и538.8%
 
а498.1%
 
о477.8%
 
е325.3%
 
я315.1%
 
р294.8%
 
н274.5%
 
к244.0%
 
Р233.8%
 
Л172.8%
 
С172.8%
 
в162.6%
 
т142.3%
 
б142.3%
 
М132.2%
 
А132.2%
 
г101.7%
 
К91.5%
 
л91.5%
 
Е91.5%
 
й91.5%
 
у81.3%
 
м71.2%
 
П71.2%
 
Other values (21)538.8%
 

Most frequent Math Alphanum characters

ValueCountFrequency (%) 
𝕖65.0%
 
𝗧43.3%
 
𝕒43.3%
 
𝖊32.5%
 
𝖎32.5%
 
𝖓32.5%
 
𝖔32.5%
 
𝑒32.5%
 
𝗮32.5%
 
𝓮21.7%
 
𝒆21.7%
 
𝒅21.7%
 
𝒓21.7%
 
𝖆21.7%
 
𝖙21.7%
 
𝖗21.7%
 
𝖉21.7%
 
𝗘21.7%
 
𝗦21.7%
 
𝚃21.7%
 
𝚎21.7%
 
𝚡21.7%
 
𝚊21.7%
 
𝚜21.7%
 
𝓇21.7%
 
Other values (46)5646.7%
 

Most frequent Emoticons characters

ValueCountFrequency (%) 
😉419.0%
 
😂419.0%
 
😎314.3%
 
😊29.5%
 
🙄29.5%
 
🙏14.8%
 
🙌14.8%
 
😌14.8%
 
😝14.8%
 
😢14.8%
 
😷14.8%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
24.8%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
Other values (11)1126.2%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
31714.2%
 
27512.3%
 
1948.7%
 
1597.1%
 
1295.8%
 
1004.5%
 
ि723.2%
 
693.1%
 
673.0%
 
642.9%
 
612.7%
 
592.6%
 
552.5%
 
532.4%
 
522.3%
 
462.1%
 
431.9%
 
411.8%
 
401.8%
 
401.8%
 
341.5%
 
271.2%
 
221.0%
 
210.9%
 
210.9%
 
Other values (27)1787.9%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
436.4%
 
436.4%
 
218.2%
 
19.1%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
Ծ1100.0%
 

Most frequent UCAS characters

ValueCountFrequency (%) 
323.1%
 
215.4%
 
215.4%
 
215.4%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 

Most frequent Yi Syllables characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Yi Radicals characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
1213.3%
 
1213.3%
 
1011.1%
 
88.9%
 
77.8%
 
66.7%
 
66.7%
 
66.7%
 
ಿ44.4%
 
33.3%
 
33.3%
 
22.2%
 
22.2%
 
22.2%
 
22.2%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
י320.0%
 
ש320.0%
 
ר320.0%
 
א320.0%
 
ל320.0%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
1820.0%
 
1213.3%
 
1112.2%
 
910.0%
 
910.0%
 
910.0%
 
44.4%
 
44.4%
 
22.2%
 
22.2%
 
22.2%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
913.2%
 
811.8%
 
710.3%
 
710.3%
 
ி57.4%
 
45.9%
 
45.9%
 
34.4%
 
34.4%
 
34.4%
 
22.9%
 
22.9%
 
22.9%
 
22.9%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 

Most frequent Arrows characters

ValueCountFrequency (%) 
325.0%
 
325.0%
 
216.7%
 
18.3%
 
18.3%
 
18.3%
 
18.3%
 

Most frequent Misc Technical characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Math Operators characters

ValueCountFrequency (%) 
350.0%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
318.8%
 
318.8%
 
212.5%
 
212.5%
 
212.5%
 
16.2%
 
16.2%
 
ି16.2%
 
16.2%
 

Most frequent Phonetic Ext characters

ValueCountFrequency (%) 
430.8%
 
323.1%
 
215.4%
 
215.4%
 
17.7%
 
17.7%
 

Most frequent Latin Ext B characters

ValueCountFrequency (%) 
ƚ266.7%
 
ǝ133.3%
 

Most frequent IPA Ext characters

ValueCountFrequency (%) 
ʀ422.2%
 
ɘ316.7%
 
ɒ316.7%
 
ɪ211.1%
 
ɿ211.1%
 
ɥ15.6%
 
ʇ15.6%
 
ɔ15.6%
 
ʞ15.6%
 

Most frequent Cyrillic Sup characters

ValueCountFrequency (%) 
Ԁ1100.0%
 

Most frequent Thai characters

ValueCountFrequency (%) 
2811.1%
 
2811.1%
 
2811.1%
 
2811.1%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 
145.6%
 

Most frequent Box Drawing characters

ValueCountFrequency (%) 
250.0%
 
250.0%
 

Most frequent Latin Ext D characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Tags characters

ValueCountFrequency (%) 
󠁧520.8%
 
󠁢416.7%
 
󠁿416.7%
 
󠁳312.5%
 
󠁣28.3%
 
󠁴28.3%
 
󠁷14.2%
 
󠁬14.2%
 
󠁥14.2%
 
󠁮14.2%
 

Most frequent Letterlike Symbols characters

ValueCountFrequency (%) 
327.3%
 
327.3%
 
218.2%
 
218.2%
 
19.1%
 

Most frequent Currency Symbols characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
413.8%
 
310.3%
 
310.3%
 
26.9%
 
26.9%
 
26.9%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
13.4%
 
ి13.4%
 
13.4%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
240.0%
 
240.0%
 
120.0%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
216.7%
 
216.7%
 
216.7%
 
216.7%
 
216.7%
 
216.7%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
218.2%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 
19.1%
 

Most frequent Greek Ext characters

ValueCountFrequency (%) 
266.7%
 
133.3%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
233.3%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Braille characters

ValueCountFrequency (%) 
23100.0%
 

Most frequent Syriac characters

ValueCountFrequency (%) 
݁1100.0%
 

Most frequent Small Forms characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Playing Cards characters

ValueCountFrequency (%) 
🃏1100.0%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Latin Ext Additional characters

ValueCountFrequency (%) 
240.0%
 
240.0%
 
120.0%
 

Most frequent Modifier Letters characters

ValueCountFrequency (%) 
ʻ2100.0%
 

Most frequent Cyrillic Ext B characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Geometric Shapes characters

ValueCountFrequency (%) 
3100.0%
 

Most frequent Tangut characters

ValueCountFrequency (%) 
𗀠1100.0%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
125.0%
 
125.0%
 
125.0%
 
125.0%
 

user_description
Categorical

HIGH CARDINALITY
MISSING

Distinct24494
Distinct (%)57.0%
Missing3090
Missing (%)6.7%
Memory size360.0 KiB
George Tsanis – Workout Solutions Health and Fitness Consultants since 1996 – One-on-one and online distance coaching – Toronto, Canada, World
 
1026
Sputnik is a global wire, radio and digital news service. We exist to tell the stories that aren’t being told.
 
283
Latest business news and valuable information from China.
 
184
Official Twitter account of Ilke News Agency /
 
135
| political | cats | bikes | civil rights | tech | photography
 
126
Other values (24489)
41215 
ValueCountFrequency (%) 
George Tsanis – Workout Solutions Health and Fitness Consultants since 1996 – One-on-one and online distance coaching – Toronto, Canada, World10262.2%
 
Sputnik is a global wire, radio and digital news service. We exist to tell the stories that aren’t being told.2830.6%
 
Latest business news and valuable information from China.1840.4%
 
Official Twitter account of Ilke News Agency /1350.3%
 
| political | cats | bikes | civil rights | tech | photography1260.3%
 
We are a group of traders who are here to impart financial education1210.3%
 
The largest newspaper in China1100.2%
 
Mask-loving, Trump-hating liberal opposed to genetic vaccines🌲🌲 Warp Speed denied U.S. access to traditional vaccines. Until a vaccine there's Ivermectin950.2%
 
News, views and up-to-date reports from Malaysia's premier news source. All that and more at https://t.co/S8jbx5pMaF910.2%
 
Brazil SFE®| We are passionate about improving our world with #Artificialintelligence, #Automation, #Analytics #innovation #digital #ai #vr #ar #ml #rpa900.2%
 
The official twitter account of the Embassy of the People's Republic of China in the Republic of the Philippines880.2%
 
CGTN is an international media organization. It aims to provide global audiences with accurate and timely news coverage as well as rich audiovisual services.790.2%
 
The official account of The Peninsula English Daily Newspaper #Qatar #Doha730.2%
 
I just share my Passion for the Stock Market & my own Conviction. Positive Mindeset🌞 $OCGN $SYA $CBDT $QYOU No financial Advice or Buy Recommendation📈710.2%
 
✍️ INFORMED CONSENT in a socially responsible and just manner. Society has responsibility to ensure means of compensating those with vaccine-related injuries.680.1%
 
CCTV+ is a leading video news agency in China that offers Chinese news and Chinese perspective on international news.680.1%
 
Reporting Africa, BRI; Africa fellow in Univ; Years in Africa; Charhar Inst. N.S Korea study;Analyst on China overseas political & economic stakes.Personal view630.1%
 
Investor with positive & fresh Mindeset🌴☀️ $OCGN $CBDT $CWGYF $QYOU $SYA $IPNFF No financial Advice or Buy Recommendation📈600.1%
 
Ex @Tibetans supports Tibetan National Resistance against China’s Military Occupation. When Dalai Lama escaped into Exile, CCP’s dictator Mao said:WE LOST TIBET560.1%
 
Freedom over censorship, truth over narrative On air in 100+ countries. 10+ billion video views in 2020 Don’t want your news filtered - find us at https://t.co/9V8JaMqU2C560.1%
 
Human being one of many not for sale550.1%
 
India's largest independent News Agency550.1%
 
love the beauty of the nature and I am beautiful too, student of commerce, love to sing,dance and travel at new places530.1%
 
Professional trading consultant specializing in contracting and due diligence with a strong pres­ence and network in international markets.520.1%
 
Research Consultant: Political-Economy Analysis,Geopolitics. #Russia, Ex-USSR,M.East. https://t.co/bnDgl9eyg4, https://t.co/KRorvOFoW1, PMPMag Rt#End520.1%
 
Other values (24469)3975986.3%
 
(Missing)30906.7%
 
2021-05-15T20:26:51.022229image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique19501 ?
Unique (%)45.4%
2021-05-15T20:26:51.425855image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length248
Median length115
Mean length101.1463775
Min length1

Overview of Unicode Properties

Unique unicode characters3541
Unique unicode categories24 ?
Unique unicode scripts42 ?
Unique unicode blocks79 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
63232513.6%
 
e3665127.9%
 
a2751985.9%
 
i2662135.7%
 
n2640265.7%
 
o2619275.6%
 
t2594165.6%
 
r2235334.8%
 
s2111194.5%
 
l1470313.2%
 
c1115522.4%
 
d1101032.4%
 
h991192.1%
 
u898331.9%
 
m784641.7%
 
g651231.4%
 
p644891.4%
 
.642851.4%
 
f580551.2%
 
y539201.2%
 
,513251.1%
 
w481441.0%
 
v422510.9%
 
b378890.8%
 
C319950.7%
 
Other values (3516)74485416.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter319353068.5%
 
Space Separator63250113.6%
 
Uppercase Letter4183809.0%
 
Other Punctuation2234144.8%
 
Decimal Number392400.8%
 
Other Symbol326660.7%
 
Other Letter297240.6%
 
Control184120.4%
 
Dash Punctuation179270.4%
 
Math Symbol170780.4%
 
Nonspacing Mark105660.2%
 
Spacing Mark55800.1%
 
Close Punctuation35080.1%
 
Open Punctuation33720.1%
 
Final Punctuation32300.1%
 
Format27200.1%
 
Connector Punctuation25320.1%
 
Currency Symbol1762< 0.1%
 
Modifier Symbol1287< 0.1%
 
Initial Punctuation692< 0.1%
 
Modifier Letter485< 0.1%
 
Enclosing Mark52< 0.1%
 
Private Use27< 0.1%
 
Other Number16< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C319957.6%
 
S305567.3%
 
A295907.1%
 
T290807.0%
 
I259736.2%
 
P237445.7%
 
M234735.6%
 
E214765.1%
 
N193554.6%
 
R185254.4%
 
D179834.3%
 
B177754.2%
 
F171554.1%
 
L156933.8%
 
O150423.6%
 
H148413.5%
 
W139983.3%
 
G128783.1%
 
U94732.3%
 
V80351.9%
 
J58991.4%
 
K49771.2%
 
Y47631.1%
 
Q19290.5%
 
X15190.4%
 
Other values (225)26530.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e36651211.5%
 
a2751988.6%
 
i2662138.3%
 
n2640268.3%
 
o2619278.2%
 
t2594168.1%
 
r2235337.0%
 
s2111196.6%
 
l1470314.6%
 
c1115523.5%
 
d1101033.4%
 
h991193.1%
 
u898332.8%
 
m784642.5%
 
g651232.0%
 
p644892.0%
 
f580551.8%
 
y539201.7%
 
w481441.5%
 
v422511.3%
 
b378891.2%
 
k276760.9%
 
x71320.2%
 
j52070.2%
 
z50490.2%
 
Other values (464)145490.5%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
632325> 99.9%
 
 171< 0.1%
 
 4< 0.1%
 
1< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.6428528.8%
 
,5132523.0%
 
#3116814.0%
 
/2293210.3%
 
@130625.8%
 
&100464.5%
 
:97114.3%
 
!62172.8%
 
'60042.7%
 
;25271.1%
 
20840.9%
 
"9610.4%
 
*8610.4%
 
?5750.3%
 
%4190.2%
 
3780.2%
 
1930.1%
 
\109< 0.1%
 
92< 0.1%
 
87< 0.1%
 
،55< 0.1%
 
49< 0.1%
 
45< 0.1%
 
33< 0.1%
 
33< 0.1%
 
Other values (22)1630.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1748119.1%
 
2572514.6%
 
9562414.3%
 
0556514.2%
 
630837.9%
 
428207.2%
 
325666.5%
 
522925.8%
 
822395.7%
 
718004.6%
 
𝟏11< 0.1%
 
𝟗5< 0.1%
 
𝟖5< 0.1%
 
4< 0.1%
 
2< 0.1%
 
𝟸2< 0.1%
 
𝟺2< 0.1%
 
۹2< 0.1%
 
𝟚2< 0.1%
 
2< 0.1%
 
𝟐1< 0.1%
 
𝟔1< 0.1%
 
1< 0.1%
 
𝟜1< 0.1%
 
𝟘1< 0.1%
 
Other values (3)3< 0.1%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
🇺11443.5%
 
🇮9402.9%
 
🇳8652.6%
 
8402.6%
 
🇸8172.5%
 
🇪6632.0%
 
🌈5961.8%
 
🇦5331.6%
 
🇧4891.5%
 
🇨4691.4%
 
🇬4471.4%
 
🇷4281.3%
 
🏳4011.2%
 
🙏3791.2%
 
🚩3711.1%
 
💙3431.1%
 
🇵3251.0%
 
👩3010.9%
 
🇰2910.9%
 
📈2870.9%
 
🌲2790.9%
 
🇹2550.8%
 
2540.8%
 
🇭2510.8%
 
🇲2460.8%
 
Other values (1139)2045262.6%
 

Most frequent Format characters

ValueCountFrequency (%) 
135349.7%
 
󠁧2268.3%
 
2057.5%
 
󠁢1706.2%
 
󠁿1686.2%
 
󠁳1134.2%
 
󠁣712.6%
 
󠁴702.6%
 
652.4%
 
󠁥562.1%
 
󠁮562.1%
 
­521.9%
 
󠁷431.6%
 
󠁬421.5%
 
180.7%
 
60.2%
 
60.2%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-1452481.0%
 
316917.7%
 
1991.1%
 
190.1%
 
100.1%
 
5< 0.1%
 
1< 0.1%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
|1415882.9%
 
+11856.9%
 
~5943.5%
 
=5643.3%
 
4472.6%
 
140.1%
 
130.1%
 
130.1%
 
130.1%
 
110.1%
 
7< 0.1%
 
6< 0.1%
 
×6< 0.1%
 
6< 0.1%
 
5< 0.1%
 
4< 0.1%
 
÷3< 0.1%
 
3< 0.1%
 
3< 0.1%
 
3< 0.1%
 
3< 0.1%
 
2< 0.1%
 
2< 0.1%
 
2< 0.1%
 
2< 0.1%
 
Other values (8)90.1%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_252899.8%
 
40.2%
 

Most frequent Control characters

ValueCountFrequency (%) 
1822499.0%
 
1861.0%
 
2< 0.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(321295.3%
 
[471.4%
 
{371.1%
 
300.9%
 
180.5%
 
120.4%
 
60.2%
 
50.1%
 
40.1%
 
﴿1< 0.1%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)334895.4%
 
]491.4%
 
}401.1%
 
300.9%
 
180.5%
 
120.3%
 
60.2%
 
40.1%
 
1< 0.1%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
275785.4%
 
45714.1%
 
»160.5%
 

Most frequent Modifier Symbol characters

ValueCountFrequency (%) 
🏻47136.6%
 
🏼28822.4%
 
🏾19615.2%
 
🏽19315.0%
 
🏿574.4%
 
^443.4%
 
`161.2%
 
¯90.7%
 
¸50.4%
 
´40.3%
 
¨40.3%
 

Most frequent Nonspacing Mark characters

ValueCountFrequency (%) 
388836.8%
 
188617.8%
 
107810.2%
 
7096.7%
 
͟4814.6%
 
4644.4%
 
2852.7%
 
2502.4%
 
2412.3%
 
960.9%
 
ಿ900.9%
 
َ740.7%
 
710.7%
 
640.6%
 
620.6%
 
520.5%
 
510.5%
 
ِ450.4%
 
ّ390.4%
 
370.4%
 
370.4%
 
ి360.3%
 
ُ330.3%
 
250.2%
 
ْ240.2%
 
Other values (94)4484.2%
 

Most frequent Other Letter characters

ValueCountFrequency (%) 
19026.4%
 
12114.1%
 
11864.0%
 
11733.9%
 
ا9953.3%
 
9473.2%
 
9033.0%
 
8582.9%
 
6982.3%
 
6822.3%
 
ل6592.2%
 
6242.1%
 
4781.6%
 
4361.5%
 
ي4261.4%
 
4261.4%
 
م4021.4%
 
ن3811.3%
 
ر3761.3%
 
و3651.2%
 
3541.2%
 
3411.1%
 
3261.1%
 
3111.0%
 
2690.9%
 
Other values (1189)1299543.7%
 

Most frequent Currency Symbol characters

ValueCountFrequency (%) 
$171897.5%
 
211.2%
 
£120.7%
 
¢50.3%
 
20.1%
 
¤20.1%
 
10.1%
 
¥10.1%
 

Most frequent Spacing Mark characters

ValueCountFrequency (%) 
206136.9%
 
ि110819.9%
 
74113.3%
 
57910.4%
 
ி1502.7%
 
1192.1%
 
791.4%
 
691.2%
 
681.2%
 
651.2%
 
530.9%
 
470.8%
 
ি390.7%
 
310.6%
 
270.5%
 
270.5%
 
240.4%
 
230.4%
 
220.4%
 
200.4%
 
190.3%
 
180.3%
 
170.3%
 
160.3%
 
140.3%
 
Other values (35)1442.6%
 

Most frequent Initial Punctuation characters

ValueCountFrequency (%) 
44964.9%
 
22732.8%
 
«162.3%
 

Most frequent Other Number characters

ValueCountFrequency (%) 
²1062.5%
 
212.5%
 
¾212.5%
 
16.2%
 
16.2%
 

Most frequent Modifier Letter characters

ValueCountFrequency (%) 
8617.7%
 
387.8%
 
347.0%
 
ـ326.6%
 
265.4%
 
265.4%
 
204.1%
 
ʳ193.9%
 
ˢ193.9%
 
183.7%
 
173.5%
 
163.3%
 
ʰ142.9%
 
ˡ122.5%
 
122.5%
 
102.1%
 
91.9%
 
71.4%
 
61.2%
 
51.0%
 
40.8%
 
ʲ40.8%
 
40.8%
 
40.8%
 
40.8%
 
Other values (24)398.0%
 

Most frequent Enclosing Mark characters

ValueCountFrequency (%) 
2853.8%
 
҉2446.2%
 

Most frequent Private Use characters

ValueCountFrequency (%) 
1970.4%
 
311.1%
 
311.1%
 
󾓩27.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin360253377.3%
 
Common100186421.5%
 
Devanagari245160.5%
 
Arabic65770.1%
 
Inherited61130.1%
 
Cyrillic57470.1%
 
Han32350.1%
 
Tamil1696< 0.1%
 
Kannada1177< 0.1%
 
Greek1145< 0.1%
 
Hiragana732< 0.1%
 
Telugu596< 0.1%
 
Bengali546< 0.1%
 
Thai545< 0.1%
 
Katakana458< 0.1%
 
Hebrew198< 0.1%
 
Gurmukhi195< 0.1%
 
Canadian_Aboriginal143< 0.1%
 
Malayalam116< 0.1%
 
Oriya109< 0.1%
 
Sinhala102< 0.1%
 
Gujarati97< 0.1%
 
Hangul87< 0.1%
 
Braille40< 0.1%
 
Armenian34< 0.1%
 
Other values (17)100< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e36651210.2%
 
a2751987.6%
 
i2662137.4%
 
n2640267.3%
 
o2619277.3%
 
t2594167.2%
 
r2235336.2%
 
s2111195.9%
 
l1470314.1%
 
c1115523.1%
 
d1101033.1%
 
h991192.8%
 
u898332.5%
 
m784642.2%
 
g651231.8%
 
p644891.8%
 
f580551.6%
 
y539201.5%
 
w481441.3%
 
v422511.2%
 
b378891.1%
 
C319950.9%
 
S305560.8%
 
A295900.8%
 
T290800.8%
 
Other values (224)3473959.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
63232563.1%
 
.642856.4%
 
,513255.1%
 
#311683.1%
 
/229322.3%
 
182241.8%
 
-145241.4%
 
|141581.4%
 
@130621.3%
 
&100461.0%
 
:97111.0%
 
174810.7%
 
!62170.6%
 
'60040.6%
 
257250.6%
 
956240.6%
 
055650.6%
 
)33480.3%
 
(32120.3%
 
31690.3%
 
630830.3%
 
428200.3%
 
27570.3%
 
325660.3%
 
_25280.3%
 
Other values (1689)600056.0%
 

Most frequent Inherited characters

ValueCountFrequency (%) 
388863.6%
 
135322.1%
 
͟4817.9%
 
َ741.2%
 
510.8%
 
ِ450.7%
 
ّ390.6%
 
ُ330.5%
 
280.5%
 
ْ240.4%
 
̶130.2%
 
̞100.2%
 
ً90.1%
 
̈80.1%
 
60.1%
 
ٰ60.1%
 
͌40.1%
 
̽40.1%
 
ٍ2< 0.1%
 
́2< 0.1%
 
͛2< 0.1%
 
͖2< 0.1%
 
̆2< 0.1%
 
̂2< 0.1%
 
͚2< 0.1%
 
Other values (17)230.4%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا99515.1%
 
ل65910.0%
 
ي4266.5%
 
م4026.1%
 
ن3815.8%
 
ر3765.7%
 
و3655.5%
 
ت2563.9%
 
ب2233.4%
 
ع2173.3%
 
س2143.3%
 
د2143.3%
 
ة1912.9%
 
ف1271.9%
 
ح1261.9%
 
ه1241.9%
 
ق1111.7%
 
ك1081.6%
 
ی1061.6%
 
ش811.2%
 
خ801.2%
 
ج711.1%
 
أ711.1%
 
ط691.0%
 
ز661.0%
 
Other values (30)5187.9%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
20618.4%
 
19027.8%
 
18867.7%
 
12114.9%
 
11864.8%
 
11734.8%
 
ि11084.5%
 
10784.4%
 
9473.9%
 
9033.7%
 
8583.5%
 
7413.0%
 
7092.9%
 
6982.8%
 
6822.8%
 
6242.5%
 
5792.4%
 
4781.9%
 
4641.9%
 
4361.8%
 
4261.7%
 
3541.4%
 
3411.4%
 
3261.3%
 
3111.3%
 
Other values (47)303412.4%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
и5329.3%
 
с4738.2%
 
о4437.7%
 
а4227.3%
 
е3335.8%
 
н3145.5%
 
р2524.4%
 
т2494.3%
 
к2424.2%
 
л2203.8%
 
в2123.7%
 
у1532.7%
 
д1522.6%
 
м1262.2%
 
ь1202.1%
 
я1111.9%
 
п971.7%
 
з961.7%
 
й951.7%
 
П891.5%
 
ы871.5%
 
х821.4%
 
Р721.3%
 
ю691.2%
 
ц611.1%
 
Other values (42)64511.2%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
5311.6%
 
337.2%
 
296.3%
 
286.1%
 
286.1%
 
204.4%
 
183.9%
 
153.3%
 
143.1%
 
143.1%
 
143.1%
 
132.8%
 
132.8%
 
122.6%
 
122.6%
 
112.4%
 
112.4%
 
81.7%
 
71.5%
 
71.5%
 
71.5%
 
61.3%
 
61.3%
 
61.3%
 
51.1%
 
Other values (28)6814.8%
 

Most frequent Han characters

ValueCountFrequency (%) 
601.9%
 
581.8%
 
571.8%
 
561.7%
 
531.6%
 
491.5%
 
441.4%
 
421.3%
 
391.2%
 
341.1%
 
331.0%
 
311.0%
 
300.9%
 
290.9%
 
290.9%
 
290.9%
 
270.8%
 
270.8%
 
260.8%
 
250.8%
 
250.8%
 
250.8%
 
240.7%
 
240.7%
 
240.7%
 
Other values (547)233572.2%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
567.7%
 
466.3%
 
456.1%
 
446.0%
 
405.5%
 
405.5%
 
395.3%
 
354.8%
 
344.6%
 
324.4%
 
294.0%
 
223.0%
 
223.0%
 
202.7%
 
192.6%
 
172.3%
 
162.2%
 
152.0%
 
141.9%
 
141.9%
 
111.5%
 
111.5%
 
111.5%
 
81.1%
 
81.1%
 
Other values (26)8411.5%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
33.4%
 
33.4%
 
33.4%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
Other values (48)4855.2%
 

Most frequent Greek characters

ValueCountFrequency (%) 
ε897.8%
 
π766.6%
 
α746.5%
 
ο706.1%
 
ι696.0%
 
ς685.9%
 
σ665.8%
 
ν534.6%
 
ρ484.2%
 
υ383.3%
 
ω383.3%
 
κ312.7%
 
η292.5%
 
θ292.5%
 
μ242.1%
 
τ221.9%
 
Φ211.8%
 
Δ201.7%
 
λ171.5%
 
Θ161.4%
 
Σ151.3%
 
ό151.3%
 
έ151.3%
 
γ141.2%
 
Ω131.1%
 
Other values (42)17515.3%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
28516.8%
 
1649.7%
 
ி1508.8%
 
1418.3%
 
975.7%
 
925.4%
 
764.5%
 
734.3%
 
694.1%
 
653.8%
 
482.8%
 
452.7%
 
442.6%
 
442.6%
 
362.1%
 
331.9%
 
331.9%
 
291.7%
 
271.6%
 
241.4%
 
171.0%
 
140.8%
 
140.8%
 
120.7%
 
110.6%
 
Other values (14)533.1%
 

Most frequent Unknown characters

ValueCountFrequency (%) 
1970.4%
 
311.1%
 
311.1%
 
󾓩27.4%
 

Most frequent Braille characters

ValueCountFrequency (%) 
40100.0%
 

Most frequent Egyptian_Hieroglyphs characters

ValueCountFrequency (%) 
𓅓133.3%
 
𓆉133.3%
 
𓇽133.3%
 

Most frequent Old_Turkic characters

ValueCountFrequency (%) 
𐰀315.8%
 
𐰤210.5%
 
𐰢210.5%
 
𐰆210.5%
 
𐰇210.5%
 
𐱃15.3%
 
𐰞15.3%
 
𐱅15.3%
 
𐰼15.3%
 
𐰚15.3%
 
𐰓15.3%
 
𐰃15.3%
 
𐰘15.3%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
ି109.2%
 
87.3%
 
76.4%
 
76.4%
 
65.5%
 
65.5%
 
43.7%
 
43.7%
 
43.7%
 
43.7%
 
43.7%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
Other values (11)1110.1%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
7914.5%
 
ি397.1%
 
376.8%
 
346.2%
 
254.6%
 
254.6%
 
224.0%
 
193.5%
 
183.3%
 
183.3%
 
183.3%
 
142.6%
 
142.6%
 
142.6%
 
132.4%
 
122.2%
 
112.0%
 
112.0%
 
101.8%
 
91.6%
 
91.6%
 
81.5%
 
81.5%
 
71.3%
 
71.3%
 
Other values (20)6511.9%
 

Most frequent Cuneiform characters

ValueCountFrequency (%) 
𒀭2100.0%
 

Most frequent Sinhala characters

ValueCountFrequency (%) 
1211.8%
 
87.8%
 
76.9%
 
65.9%
 
65.9%
 
65.9%
 
54.9%
 
54.9%
 
43.9%
 
43.9%
 
43.9%
 
32.9%
 
32.9%
 
32.9%
 
32.9%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
Other values (7)76.9%
 

Most frequent Thai characters

ValueCountFrequency (%) 
397.2%
 
376.8%
 
325.9%
 
285.1%
 
244.4%
 
213.9%
 
203.7%
 
183.3%
 
183.3%
 
183.3%
 
173.1%
 
162.9%
 
142.6%
 
142.6%
 
122.2%
 
122.2%
 
122.2%
 
112.0%
 
112.0%
 
112.0%
 
112.0%
 
112.0%
 
101.8%
 
91.7%
 
91.7%
 
Other values (28)11020.2%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
1079.1%
 
968.2%
 
ಿ907.6%
 
806.8%
 
806.8%
 
685.8%
 
534.5%
 
504.2%
 
484.1%
 
443.7%
 
433.7%
 
413.5%
 
373.1%
 
282.4%
 
272.3%
 
272.3%
 
252.1%
 
232.0%
 
211.8%
 
191.6%
 
191.6%
 
171.4%
 
141.2%
 
141.2%
 
131.1%
 
Other values (18)937.9%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
א2211.1%
 
ו189.1%
 
י168.1%
 
ל168.1%
 
ר157.6%
 
ש136.6%
 
ת94.5%
 
ה84.0%
 
ְ84.0%
 
ָ84.0%
 
ע63.0%
 
מ63.0%
 
ב63.0%
 
ד52.5%
 
ג52.5%
 
ֶ52.5%
 
ך42.0%
 
כ31.5%
 
ם31.5%
 
ּ31.5%
 
ַ31.5%
 
נ21.0%
 
ח21.0%
 
צ21.0%
 
ף21.0%
 
Other values (6)84.0%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
2010.3%
 
168.2%
 
147.2%
 
94.6%
 
94.6%
 
84.1%
 
73.6%
 
73.6%
 
73.6%
 
73.6%
 
73.6%
 
63.1%
 
63.1%
 
63.1%
 
52.6%
 
ਿ52.6%
 
52.6%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
21.0%
 
Other values (17)2110.8%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
770.0%
 
220.0%
 
110.0%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
1212.4%
 
77.2%
 
77.2%
 
66.2%
 
66.2%
 
66.2%
 
55.2%
 
55.2%
 
55.2%
 
44.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
િ11.0%
 
11.0%
 
Other values (5)55.2%
 

Most frequent Malayalam characters

ValueCountFrequency (%) 
1412.1%
 
108.6%
 
108.6%
 
97.8%
 
ി76.0%
 
65.2%
 
54.3%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
32.6%
 
32.6%
 
32.6%
 
32.6%
 
32.6%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
10.9%
 
10.9%
 
10.9%
 
Other values (9)97.8%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
528.7%
 
477.9%
 
376.2%
 
ి366.0%
 
345.7%
 
315.2%
 
315.2%
 
284.7%
 
274.5%
 
274.5%
 
203.4%
 
193.2%
 
152.5%
 
152.5%
 
152.5%
 
152.5%
 
132.2%
 
132.2%
 
122.0%
 
111.8%
 
101.7%
 
101.7%
 
91.5%
 
91.5%
 
81.3%
 
Other values (21)528.7%
 

Most frequent Tagalog characters

ValueCountFrequency (%) 
125.0%
 
125.0%
 
125.0%
 
125.0%
 

Most frequent Canadian_Aboriginal characters

ValueCountFrequency (%) 
2316.1%
 
1711.9%
 
1611.2%
 
1510.5%
 
149.8%
 
107.0%
 
96.3%
 
85.6%
 
64.2%
 
53.5%
 
42.8%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
228.6%
 
228.6%
 
228.6%
 
114.3%
 

Most frequent Tibetan characters

ValueCountFrequency (%) 
4100.0%
 

Most frequent Javanese characters

ValueCountFrequency (%) 
250.0%
 
250.0%
 

Most frequent Ol_Chiki characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
240.0%
 
120.0%
 
120.0%
 
120.0%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
ա823.5%
 
կ411.8%
 
ե38.8%
 
տ25.9%
 
ն25.9%
 
ր25.9%
 
Ք12.9%
 
ո12.9%
 
ղ12.9%
 
վ12.9%
 
զ12.9%
 
ը12.9%
 
֍12.9%
 
֎12.9%
 
Փ12.9%
 
ս12.9%
 
չ12.9%
 
հ12.9%
 
ի12.9%
 

Most frequent Runic characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Tifinagh characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Ethiopic characters

ValueCountFrequency (%) 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Lao characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Yi characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Bamum characters

ValueCountFrequency (%) 
𖥻1100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII455118397.7%
 
Devanagari249270.5%
 
None190410.4%
 
Punctuation113100.2%
 
Enclosed Alphanum Sup92690.2%
 
Arabic68960.1%
 
Cyrillic57470.1%
 
VS39390.1%
 
CJK32350.1%
 
Math Alphanum28070.1%
 
Latin 1 Sup24700.1%
 
Misc Symbols24390.1%
 
Dingbats2248< 0.1%
 
Tamil1696< 0.1%
 
Emoticons1524< 0.1%
 
Kannada1177< 0.1%
 
Tags1015< 0.1%
 
Phonetic Ext771< 0.1%
 
Hiragana732< 0.1%
 
Telugu596< 0.1%
 
Katakana568< 0.1%
 
Diacriticals551< 0.1%
 
Bengali546< 0.1%
 
Thai545< 0.1%
 
Math Operators501< 0.1%
 
Other values (54)29680.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
63232513.9%
 
e3665128.1%
 
a2751986.0%
 
i2662135.8%
 
n2640265.8%
 
o2619275.8%
 
t2594165.7%
 
r2235334.9%
 
s2111194.6%
 
l1470313.2%
 
c1115522.5%
 
d1101032.4%
 
h991192.2%
 
u898332.0%
 
m784641.7%
 
g651231.4%
 
p644891.4%
 
.642851.4%
 
f580551.3%
 
y539201.2%
 
,513251.1%
 
w481441.1%
 
v422510.9%
 
b378890.8%
 
C319950.7%
 
Other values (71)63733614.0%
 

Most frequent None characters

ValueCountFrequency (%) 
🌈5963.1%
 
🏻4712.5%
 
🏳4012.1%
 
🚩3711.9%
 
💙3431.8%
 
👩3011.6%
 
🏼2881.5%
 
📈2871.5%
 
🌲2791.5%
 
🌊2411.3%
 
👨2251.2%
 
🏾1961.0%
 
🏴1951.0%
 
🏽1931.0%
 
💚1831.0%
 
🎓1690.9%
 
🌴1600.8%
 
💜1550.8%
 
📚1530.8%
 
💻1440.8%
 
🚫1430.8%
 
💯1410.7%
 
🎶1400.7%
 
🌎1320.7%
 
🔥1140.6%
 
Other values (878)1302068.4%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
316928.0%
 
275724.4%
 
208418.4%
 
135312.0%
 
4574.0%
 
4494.0%
 
2272.0%
 
2051.8%
 
1991.8%
 
1931.7%
 
650.6%
 
450.4%
 
250.2%
 
190.2%
 
180.2%
 
100.1%
 
60.1%
 
60.1%
 
5< 0.1%
 
5< 0.1%
 
4< 0.1%
 
3< 0.1%
 
2< 0.1%
 
2< 0.1%
 
1< 0.1%
 

Most frequent Misc Symbols characters

ValueCountFrequency (%) 
25410.4%
 
1737.1%
 
1717.0%
 
1717.0%
 
1395.7%
 
1094.5%
 
1024.2%
 
984.0%
 
743.0%
 
692.8%
 
662.7%
 
492.0%
 
431.8%
 
371.5%
 
331.4%
 
321.3%
 
311.3%
 
301.2%
 
291.2%
 
281.1%
 
271.1%
 
271.1%
 
241.0%
 
241.0%
 
230.9%
 
Other values (80)57623.6%
 

Most frequent VS characters

ValueCountFrequency (%) 
388898.7%
 
511.3%
 

Most frequent Dingbats characters

ValueCountFrequency (%) 
84037.4%
 
2169.6%
 
1787.9%
 
1777.9%
 
1546.9%
 
1235.5%
 
1225.4%
 
853.8%
 
592.6%
 
522.3%
 
462.0%
 
291.3%
 
200.9%
 
170.8%
 
150.7%
 
140.6%
 
110.5%
 
90.4%
 
90.4%
 
80.4%
 
60.3%
 
60.3%
 
50.2%
 
40.2%
 
40.2%
 
Other values (22)391.7%
 

Most frequent Diacriticals characters

ValueCountFrequency (%) 
͟48187.3%
 
̶132.4%
 
̞101.8%
 
̈81.5%
 
͌40.7%
 
̽40.7%
 
́20.4%
 
͛20.4%
 
͖20.4%
 
̆20.4%
 
̂20.4%
 
͚20.4%
 
̟20.4%
 
͎20.4%
 
̾20.4%
 
͜20.4%
 
͡20.4%
 
̵20.4%
 
̅10.2%
 
̡10.2%
 
̨10.2%
 
̄10.2%
 
ͥ10.2%
 
ͣ10.2%
 
ͫ10.2%
 

Most frequent Enclosed Alphanum Sup characters

ValueCountFrequency (%) 
🇺114412.3%
 
🇮94010.1%
 
🇳8659.3%
 
🇸8178.8%
 
🇪6637.2%
 
🇦5335.8%
 
🇧4895.3%
 
🇨4695.1%
 
🇬4474.8%
 
🇷4284.6%
 
🇵3253.5%
 
🇰2913.1%
 
🇹2552.8%
 
🇭2512.7%
 
🇲2462.7%
 
🇱2252.4%
 
🇩1481.6%
 
🇿1311.4%
 
🇫1041.1%
 
🇴750.8%
 
🇼740.8%
 
🇾680.7%
 
🇻580.6%
 
🇽510.6%
 
🇯440.5%
 
Other values (43)1281.4%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
é54922.2%
 
í2118.5%
 
 1716.9%
 
ó1656.7%
 
á1506.1%
 
®1486.0%
 
ñ1194.8%
 
ä853.4%
 
ü682.8%
 
è602.4%
 
à582.3%
 
­522.1%
 
ö461.9%
 
¦451.8%
 
ç441.8%
 
°401.6%
 
ê381.5%
 
ú341.4%
 
¡281.1%
 
Ü230.9%
 
©220.9%
 
·220.9%
 
â200.8%
 
Ö200.8%
 
«160.6%
 
Other values (47)2369.6%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا99514.4%
 
ل6599.6%
 
ي4266.2%
 
م4025.8%
 
ن3815.5%
 
ر3765.5%
 
و3655.3%
 
ت2563.7%
 
ب2233.2%
 
ع2173.1%
 
س2143.1%
 
د2143.1%
 
ة1912.8%
 
ف1271.8%
 
ح1261.8%
 
ه1241.8%
 
ق1111.6%
 
ك1081.6%
 
ی1061.5%
 
ش811.2%
 
خ801.2%
 
َ741.1%
 
ج711.0%
 
أ711.0%
 
ط691.0%
 
Other values (41)82912.0%
 

Most frequent Geometric Shapes characters

ValueCountFrequency (%) 
13732.4%
 
13431.7%
 
7317.3%
 
143.3%
 
122.8%
 
112.6%
 
102.4%
 
71.7%
 
61.4%
 
40.9%
 
40.9%
 
40.9%
 
30.7%
 
10.2%
 
10.2%
 
10.2%
 
10.2%
 

Most frequent Tags characters

ValueCountFrequency (%) 
󠁧22622.3%
 
󠁢17016.7%
 
󠁿16816.6%
 
󠁳11311.1%
 
󠁣717.0%
 
󠁴706.9%
 
󠁥565.5%
 
󠁮565.5%
 
󠁷434.2%
 
󠁬424.1%
 

Most frequent Emoticons characters

ValueCountFrequency (%) 
🙏37924.9%
 
😷16210.6%
 
😎1419.3%
 
😊905.9%
 
😍624.1%
 
😉543.5%
 
😇533.5%
 
🙌453.0%
 
😂442.9%
 
😀432.8%
 
😁412.7%
 
🙂291.9%
 
😜281.8%
 
🙃231.5%
 
😘201.3%
 
😏201.3%
 
😸191.2%
 
😄191.2%
 
🙈171.1%
 
😈161.0%
 
😠151.0%
 
😻140.9%
 
😅130.9%
 
😺100.7%
 
😋100.7%
 
Other values (40)15710.3%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
20618.3%
 
19027.6%
 
18867.6%
 
12114.9%
 
11864.8%
 
11734.7%
 
ि11084.4%
 
10784.3%
 
9473.8%
 
9033.6%
 
8583.4%
 
7413.0%
 
7092.8%
 
6982.8%
 
6822.7%
 
6242.5%
 
5792.3%
 
4781.9%
 
4641.9%
 
4361.7%
 
4261.7%
 
3781.5%
 
3541.4%
 
3411.4%
 
3261.3%
 
Other values (49)337813.6%
 

Most frequent Math Operators characters

ValueCountFrequency (%) 
44789.2%
 
142.8%
 
132.6%
 
61.2%
 
61.2%
 
51.0%
 
30.6%
 
20.4%
 
20.4%
 
10.2%
 
10.2%
 
10.2%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
и5329.3%
 
с4738.2%
 
о4437.7%
 
а4227.3%
 
е3335.8%
 
н3145.5%
 
р2524.4%
 
т2494.3%
 
к2424.2%
 
л2203.8%
 
в2123.7%
 
у1532.7%
 
д1522.6%
 
м1262.2%
 
ь1202.1%
 
я1111.9%
 
п971.7%
 
з961.7%
 
й951.7%
 
П891.5%
 
ы871.5%
 
х821.4%
 
Р721.3%
 
ю691.2%
 
ц611.1%
 
Other values (42)64511.2%
 

Most frequent Math Alphanum characters

ValueCountFrequency (%) 
𝐞732.6%
 
𝕖662.4%
 
𝐧521.9%
 
𝐢481.7%
 
𝐚471.7%
 
𝕣461.6%
 
𝕒451.6%
 
𝕥441.6%
 
𝐭441.6%
 
𝐫421.5%
 
𝓉381.4%
 
𝒾371.3%
 
𝐨351.2%
 
𝕠321.1%
 
𝐬311.1%
 
𝕚301.1%
 
𝕤291.0%
 
𝑒281.0%
 
𝒆281.0%
 
𝕟240.9%
 
𝒕240.9%
 
𝓮240.9%
 
𝓃230.8%
 
𝐡230.8%
 
𝒂220.8%
 
Other values (350)187266.7%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
8615.1%
 
539.3%
 
335.8%
 
295.1%
 
284.9%
 
284.9%
 
274.8%
 
203.5%
 
183.2%
 
152.6%
 
142.5%
 
142.5%
 
142.5%
 
132.3%
 
132.3%
 
122.1%
 
122.1%
 
111.9%
 
111.9%
 
81.4%
 
71.2%
 
71.2%
 
71.2%
 
61.1%
 
61.1%
 
Other values (29)7613.4%
 

Most frequent Misc Technical characters

ValueCountFrequency (%) 
2147.7%
 
1329.5%
 
36.8%
 
24.5%
 
24.5%
 
24.5%
 
12.3%
 

Most frequent Latin Ext A characters

ValueCountFrequency (%) 
ı4820.3%
 
ř2611.0%
 
š2510.6%
 
İ187.6%
 
ě156.4%
 
ā145.9%
 
ğ83.4%
 
ł83.4%
 
ş62.5%
 
ż62.5%
 
ą62.5%
 
ę62.5%
 
đ62.5%
 
ů52.1%
 
ō52.1%
 
č52.1%
 
ī41.7%
 
ž41.7%
 
ă41.7%
 
ś41.7%
 
ć20.8%
 
ū20.8%
 
ő10.4%
 
ľ10.4%
 
Č10.4%
 
Other values (6)62.5%
 

Most frequent CJK characters

ValueCountFrequency (%) 
601.9%
 
581.8%
 
571.8%
 
561.7%
 
531.6%
 
491.5%
 
441.4%
 
421.3%
 
391.2%
 
341.1%
 
331.0%
 
311.0%
 
300.9%
 
290.9%
 
290.9%
 
290.9%
 
270.8%
 
270.8%
 
260.8%
 
250.8%
 
250.8%
 
250.8%
 
240.7%
 
240.7%
 
240.7%
 
Other values (547)233572.2%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
567.7%
 
466.3%
 
456.1%
 
446.0%
 
405.5%
 
405.5%
 
395.3%
 
354.8%
 
344.6%
 
324.4%
 
294.0%
 
223.0%
 
223.0%
 
202.7%
 
192.6%
 
172.3%
 
162.2%
 
152.0%
 
141.9%
 
141.9%
 
111.5%
 
111.5%
 
111.5%
 
81.1%
 
81.1%
 
Other values (26)8411.5%
 

Most frequent Phonetic Ext characters

ValueCountFrequency (%) 
12416.1%
 
7810.1%
 
759.7%
 
709.1%
 
425.4%
 
415.3%
 
384.9%
 
344.4%
 
344.4%
 
314.0%
 
263.4%
 
263.4%
 
182.3%
 
172.2%
 
162.1%
 
151.9%
 
151.9%
 
131.7%
 
101.3%
 
70.9%
 
60.8%
 
50.6%
 
40.5%
 
40.5%
 
30.4%
 
Other values (14)192.5%
 

Most frequent Phonetic Ext Sup characters

ValueCountFrequency (%) 
1250.0%
 
937.5%
 
28.3%
 
14.2%
 

Most frequent Modifier Letters characters

ValueCountFrequency (%) 
ʳ1925.3%
 
ˢ1925.3%
 
ʰ1418.7%
 
ˡ1216.0%
 
ʲ45.3%
 
ʸ22.7%
 
ː22.7%
 
ʷ22.7%
 
ʻ11.3%
 

Most frequent Enclosed Alphanum characters

ValueCountFrequency (%) 
1416.3%
 
910.5%
 
78.1%
 
67.0%
 
67.0%
 
44.7%
 
44.7%
 
44.7%
 
33.5%
 
33.5%
 
33.5%
 
33.5%
 
33.5%
 
33.5%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 

Most frequent Letterlike Symbols characters

ValueCountFrequency (%) 
3745.7%
 
1721.0%
 
78.6%
 
56.2%
 
33.7%
 
33.7%
 
22.5%
 
22.5%
 
22.5%
 
11.2%
 
11.2%
 
11.2%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
33.5%
 
33.5%
 
33.5%
 
22.4%
 
22.4%
 
22.4%
 
22.4%
 
22.4%
 
22.4%
 
22.4%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
11.2%
 
Other values (47)4755.3%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
28516.8%
 
1649.7%
 
ி1508.8%
 
1418.3%
 
975.7%
 
925.4%
 
764.5%
 
734.3%
 
694.1%
 
653.8%
 
482.8%
 
452.7%
 
442.6%
 
442.6%
 
362.1%
 
331.9%
 
331.9%
 
291.7%
 
271.6%
 
241.4%
 
171.0%
 
140.8%
 
140.8%
 
120.7%
 
110.6%
 
Other values (14)533.1%
 

Most frequent PUA characters

ValueCountFrequency (%) 
1976.0%
 
312.0%
 
312.0%
 

Most frequent Braille characters

ValueCountFrequency (%) 
40100.0%
 

Most frequent IPA Ext characters

ValueCountFrequency (%) 
ɪ8520.0%
 
ʀ7818.4%
 
ɴ7016.5%
 
ʟ5813.7%
 
ʜ286.6%
 
ʏ235.4%
 
ɢ215.0%
 
ʙ194.5%
 
ʇ71.7%
 
ɹ51.2%
 
ɟ40.9%
 
ɔ30.7%
 
ʌ30.7%
 
ʎ30.7%
 
ɥ30.7%
 
ɾ30.7%
 
ɑ20.5%
 
ʃ10.2%
 
ɐ10.2%
 
ɯ10.2%
 
ɱ10.2%
 
ɛ10.2%
 
ɫ10.2%
 
ʋ10.2%
 
ʂ10.2%
 

Most frequent Latin Ext D characters

ValueCountFrequency (%) 
4272.4%
 
1424.1%
 
23.4%
 

Most frequent Arrows characters

ValueCountFrequency (%) 
1331.7%
 
717.1%
 
37.3%
 
37.3%
 
24.9%
 
24.9%
 
24.9%
 
24.9%
 
24.9%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 
12.4%
 

Most frequent Egyptian Hieroglyphs characters

ValueCountFrequency (%) 
𓅓133.3%
 
𓆉133.3%
 
𓇽133.3%
 

Most frequent Old Turkic characters

ValueCountFrequency (%) 
𐰀315.8%
 
𐰤210.5%
 
𐰢210.5%
 
𐰆210.5%
 
𐰇210.5%
 
𐱃15.3%
 
𐰞15.3%
 
𐱅15.3%
 
𐰼15.3%
 
𐰚15.3%
 
𐰓15.3%
 
𐰃15.3%
 
𐰘15.3%
 

Most frequent Oriya characters

ValueCountFrequency (%) 
ି109.2%
 
87.3%
 
76.4%
 
76.4%
 
65.5%
 
65.5%
 
43.7%
 
43.7%
 
43.7%
 
43.7%
 
43.7%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
32.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
21.8%
 
Other values (11)1110.1%
 

Most frequent Currency Symbols characters

ValueCountFrequency (%) 
2187.5%
 
28.3%
 
14.2%
 

Most frequent Box Drawing characters

ValueCountFrequency (%) 
4477.2%
 
58.8%
 
35.3%
 
23.5%
 
23.5%
 
11.8%
 

Most frequent Bengali characters

ValueCountFrequency (%) 
7914.5%
 
ি397.1%
 
376.8%
 
346.2%
 
254.6%
 
254.6%
 
224.0%
 
193.5%
 
183.3%
 
183.3%
 
183.3%
 
142.6%
 
142.6%
 
142.6%
 
132.4%
 
122.2%
 
112.0%
 
112.0%
 
101.8%
 
91.6%
 
91.6%
 
81.5%
 
81.5%
 
71.3%
 
71.3%
 
Other values (20)6511.9%
 

Most frequent Cuneiform characters

ValueCountFrequency (%) 
𒀭2100.0%
 

Most frequent Sinhala characters

ValueCountFrequency (%) 
1211.8%
 
87.8%
 
76.9%
 
65.9%
 
65.9%
 
65.9%
 
54.9%
 
54.9%
 
43.9%
 
43.9%
 
43.9%
 
32.9%
 
32.9%
 
32.9%
 
32.9%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
Other values (7)76.9%
 

Most frequent Thai characters

ValueCountFrequency (%) 
397.2%
 
376.8%
 
325.9%
 
285.1%
 
244.4%
 
213.9%
 
203.7%
 
183.3%
 
183.3%
 
183.3%
 
173.1%
 
162.9%
 
142.6%
 
142.6%
 
122.2%
 
122.2%
 
122.2%
 
112.0%
 
112.0%
 
112.0%
 
112.0%
 
112.0%
 
101.8%
 
91.7%
 
91.7%
 
Other values (28)11020.2%
 

Most frequent Sup Arrows B characters

ValueCountFrequency (%) 
13100.0%
 

Most frequent Number Forms characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Greek Ext characters

ValueCountFrequency (%) 
323.1%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 
17.7%
 

Most frequent Block Elements characters

ValueCountFrequency (%) 
1446.7%
 
1343.3%
 
310.0%
 

Most frequent Latin Ext B characters

ValueCountFrequency (%) 
ǝ1236.4%
 
ș618.2%
 
ț39.1%
 
Ƹ39.1%
 
Ʒ39.1%
 
Ɔ26.1%
 
ư26.1%
 
ƈ13.0%
 
ƒ13.0%
 

Most frequent Playing Cards characters

ValueCountFrequency (%) 
🃏3100.0%
 

Most frequent Specials characters

ValueCountFrequency (%) 
675.0%
 
225.0%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
1079.1%
 
968.2%
 
ಿ907.6%
 
806.8%
 
806.8%
 
685.8%
 
534.5%
 
504.2%
 
484.1%
 
443.7%
 
433.7%
 
413.5%
 
373.1%
 
282.4%
 
272.3%
 
272.3%
 
252.1%
 
232.0%
 
211.8%
 
191.6%
 
191.6%
 
171.4%
 
141.2%
 
141.2%
 
131.1%
 
Other values (18)937.9%
 

Most frequent Hebrew characters

ValueCountFrequency (%) 
א2211.1%
 
ו189.1%
 
י168.1%
 
ל168.1%
 
ר157.6%
 
ש136.6%
 
ת94.5%
 
ה84.0%
 
ְ84.0%
 
ָ84.0%
 
ע63.0%
 
מ63.0%
 
ב63.0%
 
ד52.5%
 
ג52.5%
 
ֶ52.5%
 
ך42.0%
 
כ31.5%
 
ם31.5%
 
ּ31.5%
 
ַ31.5%
 
נ21.0%
 
ח21.0%
 
צ21.0%
 
ף21.0%
 
Other values (6)84.0%
 

Most frequent Geometric Shapes Ext characters

ValueCountFrequency (%) 
🟢627.3%
 
🟥522.7%
 
🟩418.2%
 
🟨313.6%
 
🟧29.1%
 
🟡14.5%
 
🟣14.5%
 

Most frequent Gurmukhi characters

ValueCountFrequency (%) 
2010.3%
 
168.2%
 
147.2%
 
94.6%
 
94.6%
 
84.1%
 
73.6%
 
73.6%
 
73.6%
 
73.6%
 
73.6%
 
63.1%
 
63.1%
 
63.1%
 
52.6%
 
ਿ52.6%
 
52.6%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
42.1%
 
21.0%
 
Other values (17)2110.8%
 

Most frequent Georgian characters

ValueCountFrequency (%) 
770.0%
 
220.0%
 
110.0%
 

Most frequent Gujarati characters

ValueCountFrequency (%) 
1212.4%
 
77.2%
 
77.2%
 
66.2%
 
66.2%
 
66.2%
 
55.2%
 
55.2%
 
55.2%
 
44.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
િ11.0%
 
11.0%
 
Other values (5)55.2%
 

Most frequent Malayalam characters

ValueCountFrequency (%) 
1412.1%
 
108.6%
 
108.6%
 
97.8%
 
ി76.0%
 
65.2%
 
54.3%
 
43.4%
 
43.4%
 
43.4%
 
43.4%
 
32.6%
 
32.6%
 
32.6%
 
32.6%
 
32.6%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
10.9%
 
10.9%
 
10.9%
 
Other values (9)97.8%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
528.7%
 
477.9%
 
376.2%
 
ి366.0%
 
345.7%
 
315.2%
 
315.2%
 
284.7%
 
274.5%
 
274.5%
 
203.4%
 
193.2%
 
152.5%
 
152.5%
 
152.5%
 
152.5%
 
132.2%
 
132.2%
 
122.0%
 
111.8%
 
101.7%
 
101.7%
 
91.5%
 
91.5%
 
81.3%
 
Other values (21)528.7%
 

Most frequent Tibetan characters

ValueCountFrequency (%) 
1477.8%
 
422.2%
 

Most frequent Tagalog characters

ValueCountFrequency (%) 
125.0%
 
125.0%
 
125.0%
 
125.0%
 

Most frequent UCAS characters

ValueCountFrequency (%) 
2316.1%
 
1711.9%
 
1611.2%
 
1510.5%
 
149.8%
 
107.0%
 
96.3%
 
85.6%
 
64.2%
 
53.5%
 
42.8%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 

Most frequent Coptic characters

ValueCountFrequency (%) 
228.6%
 
228.6%
 
228.6%
 
114.3%
 

Most frequent Javanese characters

ValueCountFrequency (%) 
250.0%
 
250.0%
 

Most frequent Ol Chiki characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Cherokee characters

ValueCountFrequency (%) 
240.0%
 
120.0%
 
120.0%
 
120.0%
 

Most frequent Arabic PF A characters

ValueCountFrequency (%) 
240.0%
 
﴿120.0%
 
120.0%
 
120.0%
 

Most frequent Latin Ext Additional characters

ValueCountFrequency (%) 
315.0%
 
315.0%
 
210.0%
 
210.0%
 
210.0%
 
210.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 

Most frequent Sup Punctuation characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Misc Math Symbols A characters

ValueCountFrequency (%) 
450.0%
 
450.0%
 

Most frequent Armenian characters

ValueCountFrequency (%) 
ա823.5%
 
կ411.8%
 
ե38.8%
 
տ25.9%
 
ն25.9%
 
ր25.9%
 
Ք12.9%
 
ո12.9%
 
ղ12.9%
 
վ12.9%
 
զ12.9%
 
ը12.9%
 
֍12.9%
 
֎12.9%
 
Փ12.9%
 
ս12.9%
 
չ12.9%
 
հ12.9%
 
ի12.9%
 

Most frequent Misc Math Symbols B characters

ValueCountFrequency (%) 
266.7%
 
133.3%
 

Most frequent Runic characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Sup PUA A characters

ValueCountFrequency (%) 
󾓩2100.0%
 

Most frequent Tifinagh characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Ethiopic characters

ValueCountFrequency (%) 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Compat Jamo characters

ValueCountFrequency (%) 
2100.0%
 

Most frequent Lao characters

ValueCountFrequency (%) 
150.0%
 
150.0%
 

Most frequent Yi Radicals characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Bamum Sup characters

ValueCountFrequency (%) 
𖥻1100.0%
 

user_created
Categorical

HIGH CARDINALITY

Distinct25871
Distinct (%)56.2%
Missing0
Missing (%)0.0%
Memory size360.0 KiB
2010-09-20 17:01:08
 
1026
2009-04-22 12:55:28
 
283
2020-05-21 15:54:09
 
246
2019-12-31 06:11:12
 
184
2020-08-11 09:12:38
 
170
Other values (25866)
44150 
ValueCountFrequency (%) 
2010-09-20 17:01:0810262.2%
 
2009-04-22 12:55:282830.6%
 
2020-05-21 15:54:092460.5%
 
2019-12-31 06:11:121840.4%
 
2020-08-11 09:12:381700.4%
 
2015-05-22 08:31:121350.3%
 
2019-03-25 17:58:431320.3%
 
2009-03-16 03:03:131260.3%
 
2020-09-17 18:16:071210.3%
 
2012-05-23 02:53:471190.3%
 
2011-05-23 15:00:261100.2%
 
2009-07-09 09:04:01910.2%
 
2015-01-02 14:13:17900.2%
 
2017-02-20 08:41:38880.2%
 
2013-01-24 03:18:59790.2%
 
2009-07-25 08:41:05730.2%
 
2015-01-16 04:52:01680.1%
 
2019-08-22 13:21:22680.1%
 
2018-02-04 12:36:42650.1%
 
2009-08-11 06:12:45570.1%
 
2017-12-29 11:04:46560.1%
 
2011-07-20 00:59:59550.1%
 
2019-10-19 12:24:33550.1%
 
2010-05-08 13:21:45550.1%
 
2013-07-19 11:26:39550.1%
 
Other values (25846)4245292.2%
 
2021-05-15T20:26:52.105082image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique20528 ?
Unique (%)44.6%
2021-05-15T20:26:52.385942image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length19
Median length19
Mean length19
Min length19

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories4 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
017063619.5%
 
112717614.5%
 
212073913.8%
 
-9211810.5%
 
:9211810.5%
 
460595.3%
 
3435275.0%
 
5391384.5%
 
4383164.4%
 
9325703.7%
 
8257822.9%
 
7239222.7%
 
6230202.6%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number64482673.7%
 
Dash Punctuation9211810.5%
 
Other Punctuation9211810.5%
 
Space Separator460595.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
017063626.5%
 
112717619.7%
 
212073918.7%
 
3435276.8%
 
5391386.1%
 
4383165.9%
 
9325705.1%
 
8257824.0%
 
7239223.7%
 
6230203.6%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-92118100.0%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
46059100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
:92118100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common875121100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
017063619.5%
 
112717614.5%
 
212073913.8%
 
-9211810.5%
 
:9211810.5%
 
460595.3%
 
3435275.0%
 
5391384.5%
 
4383164.4%
 
9325703.7%
 
8257822.9%
 
7239222.7%
 
6230202.6%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII875121100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
017063619.5%
 
112717614.5%
 
212073913.8%
 
-9211810.5%
 
:9211810.5%
 
460595.3%
 
3435275.0%
 
5391384.5%
 
4383164.4%
 
9325703.7%
 
8257822.9%
 
7239222.7%
 
6230202.6%
 

user_followers
Real number (ℝ≥0)

Distinct9752
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103497.3731
Minimum0
Maximum14919786
Zeros335
Zeros (%)0.7%
Memory size360.0 KiB
2021-05-15T20:26:52.705289image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10
Q1123
median584
Q32705.5
95-th percentile95576.7
Maximum14919786
Range14919786
Interquartile range (IQR)2582.5

Descriptive statistics

Standard deviation854781.0032
Coefficient of variation (CV)8.258963275
Kurtosis173.9796234
Mean103497.3731
Median Absolute Deviation (MAD)556
Skewness12.42369974
Sum4766985506
Variance7.306505634e+11
MonotocityNot monotonic
2021-05-15T20:26:53.054966image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
03350.7%
 
13030.7%
 
32380.5%
 
11902320.5%
 
42180.5%
 
102180.5%
 
22100.5%
 
72100.5%
 
61930.4%
 
161880.4%
 
121880.4%
 
51660.4%
 
81630.4%
 
131580.3%
 
11791550.3%
 
141430.3%
 
91420.3%
 
11771420.3%
 
151280.3%
 
111260.3%
 
171250.3%
 
261160.3%
 
501140.2%
 
271120.2%
 
221120.2%
 
Other values (9727)4162490.4%
 
ValueCountFrequency (%) 
03350.7%
 
13030.7%
 
22100.5%
 
32380.5%
 
42180.5%
 
51660.4%
 
61930.4%
 
72100.5%
 
81630.4%
 
91420.3%
 
ValueCountFrequency (%) 
149197862< 0.1%
 
148794951< 0.1%
 
148794931< 0.1%
 
148730252< 0.1%
 
148595971< 0.1%
 
148567423< 0.1%
 
148567403< 0.1%
 
148383571< 0.1%
 
148241391< 0.1%
 
148118501< 0.1%
 

user_friends
Real number (ℝ≥0)

SKEWED

Distinct5199
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1334.568011
Minimum0
Maximum380428
Zeros397
Zeros (%)0.9%
Memory size360.0 KiB
2021-05-15T20:26:53.474952image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile14
Q1148
median425
Q31222
95-th percentile4870.1
Maximum380428
Range380428
Interquartile range (IQR)1074

Descriptive statistics

Standard deviation5998.529071
Coefficient of variation (CV)4.494734644
Kurtosis2033.415311
Mean1334.568011
Median Absolute Deviation (MAD)352
Skewness37.72401569
Sum61468868
Variance35982351.02
MonotocityNot monotonic
2021-05-15T20:26:53.851081image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
03970.9%
 
13070.7%
 
3062630.6%
 
2062590.6%
 
62530.5%
 
1962240.5%
 
32170.5%
 
1971790.4%
 
21770.4%
 
1411550.3%
 
101500.3%
 
1421470.3%
 
451460.3%
 
71450.3%
 
251430.3%
 
701370.3%
 
1441230.3%
 
221200.3%
 
281200.3%
 
50011130.2%
 
381110.2%
 
211070.2%
 
171060.2%
 
261050.2%
 
1071050.2%
 
Other values (5174)4175090.6%
 
ValueCountFrequency (%) 
03970.9%
 
13070.7%
 
21770.4%
 
32170.5%
 
4860.2%
 
5930.2%
 
62530.5%
 
71450.3%
 
8990.2%
 
9970.2%
 
ValueCountFrequency (%) 
3804281< 0.1%
 
3803622< 0.1%
 
3803531< 0.1%
 
3802651< 0.1%
 
2747181< 0.1%
 
2738121< 0.1%
 
1952891< 0.1%
 
1498131< 0.1%
 
1497231< 0.1%
 
1496991< 0.1%
 

user_favourites
Real number (ℝ≥0)

ZEROS

Distinct16866
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15462.41948
Minimum0
Maximum1205878
Zeros671
Zeros (%)1.5%
Memory size360.0 KiB
2021-05-15T20:26:54.222303image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile13
Q1379
median2225
Q311555.5
95-th percentile71728
Maximum1205878
Range1205878
Interquartile range (IQR)11176.5

Descriptive statistics

Standard deviation42933.05137
Coefficient of variation (CV)2.776606301
Kurtosis109.6825323
Mean15462.41948
Median Absolute Deviation (MAD)2185
Skewness8.244921903
Sum712183579
Variance1843246900
MonotocityNot monotonic
2021-05-15T20:26:54.699771image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
06711.5%
 
32880.6%
 
12370.5%
 
18372290.5%
 
242080.5%
 
251770.4%
 
5351610.3%
 
21550.3%
 
16921510.3%
 
41480.3%
 
51430.3%
 
141130.2%
 
10631130.2%
 
71120.2%
 
101110.2%
 
341090.2%
 
61080.2%
 
151030.2%
 
13960.2%
 
33940.2%
 
19900.2%
 
17880.2%
 
11870.2%
 
502810.2%
 
32810.2%
 
Other values (16841)4210591.4%
 
ValueCountFrequency (%) 
06711.5%
 
12370.5%
 
21550.3%
 
32880.6%
 
41480.3%
 
51430.3%
 
61080.2%
 
71120.2%
 
8750.2%
 
9770.2%
 
ValueCountFrequency (%) 
12058781< 0.1%
 
9482461< 0.1%
 
9479011< 0.1%
 
9461182< 0.1%
 
9246671< 0.1%
 
8869351< 0.1%
 
8709931< 0.1%
 
8503371< 0.1%
 
7774621< 0.1%
 
7737401< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size45.1 KiB
False
40999 
True
5060 
ValueCountFrequency (%) 
False4099989.0%
 
True506011.0%
 
2021-05-15T20:26:55.122540image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

date
Categorical

HIGH CARDINALITY
UNIFORM

Distinct45622
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size360.0 KiB
2021-03-02 23:02:10
 
4
2021-02-09 07:30:00
 
3
2021-03-30 01:30:00
 
3
2021-02-24 08:30:00
 
3
2021-02-15 00:00:56
 
3
Other values (45617)
46043 
ValueCountFrequency (%) 
2021-03-02 23:02:104< 0.1%
 
2021-02-09 07:30:003< 0.1%
 
2021-03-30 01:30:003< 0.1%
 
2021-02-24 08:30:003< 0.1%
 
2021-02-15 00:00:563< 0.1%
 
2021-03-02 17:50:243< 0.1%
 
2021-03-01 04:52:103< 0.1%
 
2021-02-13 00:30:003< 0.1%
 
2021-03-01 06:37:023< 0.1%
 
2021-03-02 23:02:083< 0.1%
 
2021-03-02 05:30:003< 0.1%
 
2021-03-01 03:09:313< 0.1%
 
2021-03-31 21:25:032< 0.1%
 
2021-03-26 09:52:482< 0.1%
 
2021-03-29 23:00:012< 0.1%
 
2021-03-10 06:50:212< 0.1%
 
2021-03-04 09:12:562< 0.1%
 
2021-02-28 10:57:002< 0.1%
 
2021-03-01 04:26:592< 0.1%
 
2021-03-16 08:25:402< 0.1%
 
2021-02-07 10:02:382< 0.1%
 
2021-02-09 08:41:262< 0.1%
 
2021-03-09 13:15:002< 0.1%
 
2021-04-01 14:30:002< 0.1%
 
2021-03-31 06:16:592< 0.1%
 
Other values (45597)4599699.9%
 
2021-05-15T20:26:55.824222image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique45198 ?
Unique (%)98.1%
2021-05-15T20:26:56.177461image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length19
Median length19
Mean length19
Min length19

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories4 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
016123818.4%
 
215881818.1%
 
112506114.3%
 
-9211810.5%
 
:9211810.5%
 
3640357.3%
 
460595.3%
 
4350274.0%
 
5323723.7%
 
6175302.0%
 
9170772.0%
 
8169811.9%
 
7166871.9%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number64482673.7%
 
Dash Punctuation9211810.5%
 
Other Punctuation9211810.5%
 
Space Separator460595.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
016123825.0%
 
215881824.6%
 
112506119.4%
 
3640359.9%
 
4350275.4%
 
5323725.0%
 
6175302.7%
 
9170772.6%
 
8169812.6%
 
7166872.6%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-92118100.0%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
46059100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
:92118100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common875121100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
016123818.4%
 
215881818.1%
 
112506114.3%
 
-9211810.5%
 
:9211810.5%
 
3640357.3%
 
460595.3%
 
4350274.0%
 
5323723.7%
 
6175302.0%
 
9170772.0%
 
8169811.9%
 
7166871.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII875121100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
016123818.4%
 
215881818.1%
 
112506114.3%
 
-9211810.5%
 
:9211810.5%
 
3640357.3%
 
460595.3%
 
4350274.0%
 
5323723.7%
 
6175302.0%
 
9170772.0%
 
8169811.9%
 
7166871.9%
 

text
Categorical

HIGH CARDINALITY
UNIFORM

Distinct46018
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size360.0 KiB
@POTUS What about #Covaxin from #Ocugen ?! It seems like it's better than anything we have now! Is it coming to the US? Why or why not??
 
5
#Covid19 Vaccine Rollout Needs Spark Even More Innovation https://t.co/EBaYqJexm1 #Ergotron #VaccinationCart #Pfizer #PfizerBioNTech
 
5
@Reuters Do you know the meaning of “V” in the name of the first Russian vaccine- #SputnikV? “V” for Victory. Victory over the #pandemic.
 
5
@sputnikvaccine Not even a majority of all Russians are willing to take this vaccin! #SputnikV @sputnikvaccine.
 
3
AIIMS at which PM Modi took #Covaxin was built by Nehru.
 
3
Other values (46013)
46038 
ValueCountFrequency (%) 
@POTUS What about #Covaxin from #Ocugen ?! It seems like it's better than anything we have now! Is it coming to the US? Why or why not??5< 0.1%
 
#Covid19 Vaccine Rollout Needs Spark Even More Innovation https://t.co/EBaYqJexm1 #Ergotron #VaccinationCart #Pfizer #PfizerBioNTech5< 0.1%
 
@Reuters Do you know the meaning of “V” in the name of the first Russian vaccine- #SputnikV? “V” for Victory. Victory over the #pandemic.5< 0.1%
 
@sputnikvaccine Not even a majority of all Russians are willing to take this vaccin! #SputnikV @sputnikvaccine.3< 0.1%
 
AIIMS at which PM Modi took #Covaxin was built by Nehru.3< 0.1%
 
#Moderna Post jobs for free on https://t.co/Jxbtzryhtg2< 0.1%
 
भिखारी Pakistan to receive ‘Made in India’ COVID-19 vaccines from GAVI #COVID19Vaccine #Covaxin #CoronaVirusUpdates2< 0.1%
 
@visshnumittal Is there a quality difference in #Covaxin &amp; #CovishieldVaccine ???2< 0.1%
 
So yesterday was rough - fatigue headache and chills. #PfizerBioNTech Better today so far.2< 0.1%
 
@WHO @DrTedros Dr faucii = Also know as “the NEW angel of death” #modernA version of #JosefMengele2< 0.1%
 
Russia in talks with several Austrian companies on #SputnikV production, RDIF CEO says @sputnikvaccine https://t.co/lp4X9OgURa2< 0.1%
 
DOH: Some providers gave out 2nd doses of #Moderna vaccine as 1st doses in mishap https://t.co/9ohs4noIPN2< 0.1%
 
@sputnikvaccine @sputnikvaccine #SputnikV doesn't works, it's just a fraud 👇 https://t.co/FuXlfHHxqa2< 0.1%
 
Vaccinated and ready to dominate the world once again!😊💯💯 https://t.co/9kFWzG4fIE #dubai #dha @DHA_Dubai #vaccine #PfizerBioNTech2< 0.1%
 
PMO India says PM took the first dose of Bharat Biotech's #Covaxin Bharat Biotech2< 0.1%
 
24 hours after the 2nd #Moderna shot and, honestly, I feel like runny baby poo. Definitely on auto-pilot today &amp; tomorrow.2< 0.1%
 
'Dr Reddy's expects #SputnikV vaccine to get approval from Indian regulator in next few weeks' https://t.co/sXHXveqOz92< 0.1%
 
@WHO @DrTedros 500 deaths today in Italy. Shame to everyone! #SputnikV could save them.2< 0.1%
 
test #covaxin2< 0.1%
 
Afghanistan and Russia to discuss #SputnikV vaccine supplies soon, foreign minister Says @sputnikvaccine https://t.co/tfCtOaFC7M2< 0.1%
 
1st dose of #vaccine just now #OxfordAstraZeneca Woohoo 😃2< 0.1%
 
Russia, Turkey in talks on joint production of #SputnikV #COVID19Vaccine, ambassador says @sputnikvaccine https://t.co/adLhfJS5es2< 0.1%
 
Vaccinated! #Moderna2< 0.1%
 
@idnani_nandini #SastaBhiKargarBhi #Covaxin of Bharat Biotech2< 0.1%
 
PHARMACY and Poisons Board confirms approval for emergency use of Russia’s #SputnikV COVID vaccine in Kenya after tests.2< 0.1%
 
Other values (45993)4599899.9%
 
2021-05-15T20:26:56.966969image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique45988 ?
Unique (%)99.8%
2021-05-15T20:26:57.488263image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length156
Median length139
Mean length126.1271413
Min length13

Overview of Unicode Properties

Unique unicode characters1400
Unique unicode categories23 ?
Unique unicode scripts19 ?
Unique unicode blocks45 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
72119812.4%
 
e3900686.7%
 
t3860856.6%
 
a3235255.6%
 
o3231585.6%
 
i2921415.0%
 
n2745574.7%
 
s2512184.3%
 
c2096183.6%
 
r2080953.6%
 
h1814813.1%
 
d1455072.5%
 
/1283742.2%
 
l1123961.9%
 
p1115861.9%
 
u908951.6%
 
#863891.5%
 
f837611.4%
 
v835271.4%
 
m774301.3%
 
.733241.3%
 
y681421.2%
 
g635011.1%
 
C471920.8%
 
w470510.8%
 
Other values (1375)102907117.7%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter386759566.6%
 
Space Separator72142812.4%
 
Uppercase Letter5675699.8%
 
Other Punctuation4408837.6%
 
Decimal Number1390702.4%
 
Control282180.5%
 
Other Symbol130410.2%
 
Dash Punctuation97030.2%
 
Final Punctuation60340.1%
 
Connector Punctuation37230.1%
 
Open Punctuation2043< 0.1%
 
Close Punctuation1834< 0.1%
 
Other Letter1519< 0.1%
 
Nonspacing Mark1366< 0.1%
 
Currency Symbol1229< 0.1%
 
Math Symbol1159< 0.1%
 
Initial Punctuation1026< 0.1%
 
Modifier Symbol887< 0.1%
 
Format545< 0.1%
 
Spacing Mark149< 0.1%
 
Modifier Letter132< 0.1%
 
Enclosing Mark128< 0.1%
 
Other Number9< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C471928.3%
 
I428647.6%
 
V358806.3%
 
S339666.0%
 
M311925.5%
 
T300395.3%
 
A300085.3%
 
O299485.3%
 
D280224.9%
 
P267604.7%
 
N247204.4%
 
B234434.1%
 
E184673.3%
 
R180023.2%
 
H166882.9%
 
W153062.7%
 
F149202.6%
 
G148092.6%
 
U139652.5%
 
L124092.2%
 
K113372.0%
 
J111252.0%
 
Z109091.9%
 
Y97331.7%
 
X83411.5%
 
Other values (79)75241.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e39006810.1%
 
t38608510.0%
 
a3235258.4%
 
o3231588.4%
 
i2921417.6%
 
n2745577.1%
 
s2512186.5%
 
c2096185.4%
 
r2080955.4%
 
h1814814.7%
 
d1455073.8%
 
l1123962.9%
 
p1115862.9%
 
u908952.4%
 
f837612.2%
 
v835272.2%
 
m774302.0%
 
y681421.8%
 
g635011.6%
 
w470511.2%
 
b431091.1%
 
k375351.0%
 
x204990.5%
 
z199570.5%
 
j130750.3%
 
Other values (182)96780.3%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
721198> 99.9%
 
 215< 0.1%
 
 7< 0.1%
 
4< 0.1%
 
4< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/12837429.1%
 
#8638919.6%
 
.7332416.6%
 
:4687710.6%
 
324347.4%
 
@232815.3%
 
,197024.5%
 
!90542.1%
 
'78281.8%
 
?39860.9%
 
;29830.7%
 
&26120.6%
 
"20110.5%
 
%14370.3%
 
*3500.1%
 
133< 0.1%
 
75< 0.1%
 
·6< 0.1%
 
5< 0.1%
 
¡5< 0.1%
 
4< 0.1%
 
3< 0.1%
 
¿1< 0.1%
 
1< 0.1%
 
\1< 0.1%
 
Other values (7)7< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
12777420.0%
 
92025414.6%
 
01706112.3%
 
21626711.7%
 
3106747.7%
 
5100447.2%
 
499067.1%
 
892756.7%
 
690306.5%
 
787786.3%
 
𝟯1< 0.1%
 
𝟱1< 0.1%
 
𝟭1< 0.1%
 
𝟵1< 0.1%
 
𝟏1< 0.1%
 
𝟗1< 0.1%
 
𝟷1< 0.1%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_3723100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-936096.5%
 
2192.3%
 
1221.3%
 
2< 0.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(193594.7%
 
[974.7%
 
60.3%
 
{30.1%
 
1< 0.1%
 
1< 0.1%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)172494.0%
 
]884.8%
 
191.0%
 
}20.1%
 
10.1%
 

Most frequent Control characters

ValueCountFrequency (%) 
28216> 99.9%
 
2< 0.1%
 

Most frequent Format characters

ValueCountFrequency (%) 
16329.9%
 
14326.2%
 
13124.0%
 
8716.0%
 
71.3%
 
󠁧20.4%
 
󠁢20.4%
 
󠁳20.4%
 
󠁣20.4%
 
󠁴20.4%
 
󠁿20.4%
 
10.2%
 
10.2%
 

Most frequent Initial Punctuation characters

ValueCountFrequency (%) 
72070.2%
 
29528.8%
 
«111.1%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
💉172713.2%
 
🙏4383.4%
 
🇳3903.0%
 
🇺3192.4%
 
😂3192.4%
 
3052.3%
 
👏2992.3%
 
💪2992.3%
 
👍2762.1%
 
🇨2652.0%
 
👇2521.9%
 
2351.8%
 
🇮2101.6%
 
🇷1781.4%
 
🤣1761.3%
 
🙌1691.3%
 
😷1681.3%
 
🦠1671.3%
 
🇸1571.2%
 
🚀1551.2%
 
🇪1481.1%
 
🇦1471.1%
 
🤔1431.1%
 
😊1381.1%
 
🇬1321.0%
 
Other values (537)582944.7%
 

Most frequent Final Punctuation characters

ValueCountFrequency (%) 
550891.3%
 
5148.5%
 
»100.2%
 
2< 0.1%
 

Most frequent Nonspacing Mark characters

ValueCountFrequency (%) 
113783.2%
 
372.7%
 
292.1%
 
251.8%
 
141.0%
 
100.7%
 
100.7%
 
90.7%
 
80.6%
 
80.6%
 
̇70.5%
 
̶70.5%
 
60.4%
 
60.4%
 
40.3%
 
͟40.3%
 
40.3%
 
30.2%
 
30.2%
 
20.1%
 
20.1%
 
20.1%
 
20.1%
 
20.1%
 
20.1%
 
Other values (17)231.7%
 

Most frequent Modifier Symbol characters

ValueCountFrequency (%) 
🏻31735.7%
 
🏼24327.4%
 
🏽15517.5%
 
🏾11513.0%
 
`202.3%
 
🏿141.6%
 
^131.5%
 
´101.1%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
|62854.2%
 
+32628.1%
 
=968.3%
 
~857.3%
 
70.6%
 
60.5%
 
×30.3%
 
±20.2%
 
20.2%
 
20.2%
 
10.1%
 
10.1%
 

Most frequent Currency Symbol characters

ValueCountFrequency (%) 
$117195.3%
 
211.7%
 
£211.7%
 
161.3%
 

Most frequent Other Letter characters

ValueCountFrequency (%) 
ا1218.0%
 
835.5%
 
ر704.6%
 
ل543.6%
 
453.0%
 
و402.6%
 
ن382.5%
 
ز362.4%
 
342.2%
 
ج342.2%
 
ی251.6%
 
241.6%
 
ي231.5%
 
م211.4%
 
181.2%
 
171.1%
 
ک161.1%
 
ئ151.0%
 
151.0%
 
ب140.9%
 
د140.9%
 
140.9%
 
140.9%
 
130.9%
 
120.8%
 
Other values (312)70946.7%
 

Most frequent Enclosing Mark characters

ValueCountFrequency (%) 
128100.0%
 

Most frequent Spacing Mark characters

ValueCountFrequency (%) 
3221.5%
 
3020.1%
 
ि2416.1%
 
2416.1%
 
85.4%
 
42.7%
 
42.7%
 
32.0%
 
ி32.0%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
21.3%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 
10.7%
 

Most frequent Modifier Letter characters

ValueCountFrequency (%) 
12493.9%
 
ˈ75.3%
 
ˌ10.8%
 

Most frequent Other Number characters

ValueCountFrequency (%) 
222.2%
 
111.1%
 
111.1%
 
111.1%
 
111.1%
 
111.1%
 
²111.1%
 
½111.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin443413076.3%
 
Common137166623.6%
 
Inherited1429< 0.1%
 
Arabic639< 0.1%
 
Devanagari549< 0.1%
 
Thai176< 0.1%
 
Han121< 0.1%
 
Cyrillic110< 0.1%
 
Hangul98< 0.1%
 
Tamil88< 0.1%
 
Greek79< 0.1%
 
Myanmar67< 0.1%
 
Katakana46< 0.1%
 
Sinhala25< 0.1%
 
Telugu23< 0.1%
 
Hiragana20< 0.1%
 
Kannada18< 0.1%
 
Braille5< 0.1%
 
Canadian_Aboriginal1< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e3900688.8%
 
t3860858.7%
 
a3235257.3%
 
o3231587.3%
 
i2921416.6%
 
n2745576.2%
 
s2512185.7%
 
c2096184.7%
 
r2080954.7%
 
h1814814.1%
 
d1455073.3%
 
l1123962.5%
 
p1115862.5%
 
u908952.0%
 
f837611.9%
 
v835271.9%
 
m774301.7%
 
y681421.5%
 
g635011.4%
 
C471921.1%
 
w470511.1%
 
b431091.0%
 
I428641.0%
 
k375350.8%
 
V358800.8%
 
Other values (80)50380811.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
72119852.6%
 
/1283749.4%
 
#863896.3%
 
.733245.3%
 
:468773.4%
 
324342.4%
 
282162.1%
 
1277742.0%
 
@232811.7%
 
9202541.5%
 
,197021.4%
 
0170611.2%
 
2162671.2%
 
3106740.8%
 
5100440.7%
 
499060.7%
 
-93600.7%
 
892750.7%
 
!90540.7%
 
690300.7%
 
787780.6%
 
'78280.6%
 
55080.4%
 
?39860.3%
 
_37230.3%
 
Other values (811)333492.4%
 

Most frequent Inherited characters

ValueCountFrequency (%) 
113779.6%
 
14310.0%
 
1289.0%
 
̇70.5%
 
̶70.5%
 
͟40.3%
 
ُ10.1%
 
10.1%
 
10.1%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا12118.9%
 
ر7011.0%
 
ل548.5%
 
و406.3%
 
ن385.9%
 
ز365.6%
 
ج345.3%
 
ی253.9%
 
ي233.6%
 
م213.3%
 
ک162.5%
 
ئ152.3%
 
ب142.2%
 
د142.2%
 
ت101.6%
 
ع91.4%
 
خ81.3%
 
ہ81.3%
 
س71.1%
 
ك71.1%
 
ط71.1%
 
ة60.9%
 
ے60.9%
 
ح50.8%
 
ص50.8%
 
Other values (15)406.3%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
8315.1%
 
458.2%
 
376.7%
 
346.2%
 
325.8%
 
305.5%
 
295.3%
 
254.6%
 
ि244.4%
 
244.4%
 
244.4%
 
183.3%
 
142.6%
 
142.6%
 
122.2%
 
122.2%
 
101.8%
 
91.6%
 
81.5%
 
81.5%
 
81.5%
 
61.1%
 
61.1%
 
50.9%
 
40.7%
 
Other values (15)285.1%
 

Most frequent Braille characters

ValueCountFrequency (%) 
5100.0%
 

Most frequent Han characters

ValueCountFrequency (%) 
43.3%
 
43.3%
 
43.3%
 
43.3%
 
32.5%
 
32.5%
 
32.5%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
Other values (53)6049.6%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
о1513.6%
 
а87.3%
 
с87.3%
 
в76.4%
 
к65.5%
 
и65.5%
 
д65.5%
 
н65.5%
 
л54.5%
 
р43.6%
 
у43.6%
 
т43.6%
 
е43.6%
 
П32.7%
 
м32.7%
 
Р21.8%
 
ж21.8%
 
ч21.8%
 
б21.8%
 
В10.9%
 
і10.9%
 
С10.9%
 
Я10.9%
 
я10.9%
 
г10.9%
 
Other values (7)76.4%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
99.2%
 
55.1%
 
55.1%
 
55.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
Other values (31)3131.6%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
1415.9%
 
89.1%
 
89.1%
 
66.8%
 
44.5%
 
44.5%
 
44.5%
 
33.4%
 
33.4%
 
33.4%
 
33.4%
 
ி33.4%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
11.1%
 
11.1%
 
11.1%
 
Other values (2)22.3%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
48.7%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
24.3%
 
24.3%
 
24.3%
 
24.3%
 
24.3%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 

Most frequent Sinhala characters

ValueCountFrequency (%) 
416.0%
 
312.0%
 
312.0%
 
28.0%
 
28.0%
 
28.0%
 
28.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 

Most frequent Greek characters

ValueCountFrequency (%) 
ο1417.7%
 
μ1012.7%
 
ι67.6%
 
α56.3%
 
ς45.1%
 
ε45.1%
 
β45.1%
 
λ45.1%
 
σ45.1%
 
π22.5%
 
κ22.5%
 
ρ22.5%
 
ν22.5%
 
Δ22.5%
 
Μ22.5%
 
Ε22.5%
 
υ22.5%
 
τ22.5%
 
ω11.3%
 
Θ11.3%
 
έ11.3%
 
Σ11.3%
 
ί11.3%
 
Ν11.3%
 

Most frequent Thai characters

ValueCountFrequency (%) 
179.7%
 
158.5%
 
137.4%
 
105.7%
 
105.7%
 
95.1%
 
95.1%
 
84.5%
 
84.5%
 
52.8%
 
52.8%
 
52.8%
 
52.8%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
Other values (15)2011.4%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
811.9%
 
69.0%
 
69.0%
 
57.5%
 
က57.5%
 
46.0%
 
46.0%
 
34.5%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
210.0%
 
210.0%
 
210.0%
 
210.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
417.4%
 
313.0%
 
ి28.7%
 
28.7%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
316.7%
 
ಿ211.1%
 
211.1%
 
211.1%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 

Most frequent Canadian_Aboriginal characters

ValueCountFrequency (%) 
1100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII574928299.0%
 
Punctuation405780.7%
 
None75900.1%
 
Enclosed Alphanum Sup2783< 0.1%
 
Emoticons2314< 0.1%
 
VS1138< 0.1%
 
Dingbats920< 0.1%
 
Math Alphanum856< 0.1%
 
Arabic641< 0.1%
 
Latin 1 Sup625< 0.1%
 
Devanagari552< 0.1%
 
Misc Symbols396< 0.1%
 
Phonetic Ext319< 0.1%
 
Thai176< 0.1%
 
Katakana174< 0.1%
 
IPA Ext135< 0.1%
 
CJK121< 0.1%
 
Cyrillic110< 0.1%
 
Hangul98< 0.1%
 
Tamil88< 0.1%
 
Myanmar67< 0.1%
 
Latin Ext A60< 0.1%
 
Currency Symbols37< 0.1%
 
Geometric Shapes34< 0.1%
 
Sinhala25< 0.1%
 
Other values (20)171< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
72119812.5%
 
e3900686.8%
 
t3860856.7%
 
a3235255.6%
 
o3231585.6%
 
i2921415.1%
 
n2745574.8%
 
s2512184.4%
 
c2096183.6%
 
r2080953.6%
 
h1814813.2%
 
d1455072.5%
 
/1283742.2%
 
l1123962.0%
 
p1115861.9%
 
u908951.6%
 
#863891.5%
 
f837611.5%
 
v835271.5%
 
m774301.3%
 
.733241.3%
 
y681421.2%
 
g635011.1%
 
C471920.8%
 
w470510.8%
 
Other values (70)96906316.9%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
3243479.9%
 
550813.6%
 
7201.8%
 
5141.3%
 
2950.7%
 
2190.5%
 
1630.4%
 
1430.4%
 
1330.3%
 
1310.3%
 
1220.3%
 
870.2%
 
750.2%
 
7< 0.1%
 
6< 0.1%
 
5< 0.1%
 
4< 0.1%
 
4< 0.1%
 
2< 0.1%
 
2< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 
1< 0.1%
 

Most frequent None characters

ValueCountFrequency (%) 
💉172722.8%
 
🏻3174.2%
 
👏2993.9%
 
💪2993.9%
 
👍2763.6%
 
👇2523.3%
 
🏼2433.2%
 
🤣1762.3%
 
🦠1672.2%
 
🏽1552.0%
 
🚀1552.0%
 
🤔1431.9%
 
1281.7%
 
🥳1221.6%
 
🏾1151.5%
 
🎉1111.5%
 
🌎901.2%
 
👉801.1%
 
🥰710.9%
 
🔥690.9%
 
💙590.8%
 
👌550.7%
 
🚨540.7%
 
🤷500.7%
 
💥490.6%
 
Other values (389)232830.7%
 

Most frequent Misc Symbols characters

ValueCountFrequency (%) 
5814.6%
 
5012.6%
 
4711.9%
 
317.8%
 
307.6%
 
266.6%
 
256.3%
 
225.6%
 
225.6%
 
215.3%
 
92.3%
 
71.8%
 
51.3%
 
51.3%
 
41.0%
 
41.0%
 
30.8%
 
30.8%
 
30.8%
 
20.5%
 
20.5%
 
20.5%
 
10.3%
 
10.3%
 
10.3%
 
Other values (12)123.0%
 

Most frequent VS characters

ValueCountFrequency (%) 
113799.9%
 
10.1%
 

Most frequent Emoticons characters

ValueCountFrequency (%) 
🙏43818.9%
 
😂31913.8%
 
🙌1697.3%
 
😷1687.3%
 
😊1386.0%
 
😁1275.5%
 
😎1144.9%
 
😭753.2%
 
😍492.1%
 
😉451.9%
 
😅411.8%
 
😳411.8%
 
😃391.7%
 
😜391.7%
 
😀371.6%
 
🙂371.6%
 
🙄311.3%
 
😆271.2%
 
😱241.0%
 
😇231.0%
 
😩221.0%
 
😢221.0%
 
😌180.8%
 
😬170.7%
 
😡160.7%
 
Other values (44)23810.3%
 

Most frequent Enclosed Alphanum Sup characters

ValueCountFrequency (%) 
🇳39014.0%
 
🇺31911.5%
 
🇨2659.5%
 
🇮2107.5%
 
🇷1786.4%
 
🇸1575.6%
 
🇪1485.3%
 
🇦1475.3%
 
🇬1324.7%
 
🇧1224.4%
 
🇵1053.8%
 
🇭953.4%
 
🇰903.2%
 
🇹762.7%
 
🇲642.3%
 
🇱501.8%
 
🇿431.5%
 
🇩411.5%
 
🇾341.2%
 
🇴260.9%
 
🇽170.6%
 
🇼140.5%
 
🇫130.5%
 
🇶130.5%
 
🇻130.5%
 
Other values (6)210.8%
 

Most frequent Currency Symbols characters

ValueCountFrequency (%) 
2156.8%
 
1643.2%
 

Most frequent Dingbats characters

ValueCountFrequency (%) 
30533.2%
 
23525.5%
 
11112.1%
 
889.6%
 
434.7%
 
363.9%
 
272.9%
 
151.6%
 
121.3%
 
80.9%
 
50.5%
 
50.5%
 
40.4%
 
40.4%
 
30.3%
 
30.3%
 
30.3%
 
30.3%
 
20.2%
 
20.2%
 
10.1%
 
10.1%
 
10.1%
 
10.1%
 
10.1%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا12118.9%
 
ر7010.9%
 
ل548.4%
 
و406.2%
 
ن385.9%
 
ز365.6%
 
ج345.3%
 
ی253.9%
 
ي233.6%
 
م213.3%
 
ک162.5%
 
ئ152.3%
 
ب142.2%
 
د142.2%
 
ت101.6%
 
ع91.4%
 
خ81.2%
 
ہ81.2%
 
س71.1%
 
ك71.1%
 
ط71.1%
 
ة60.9%
 
ے60.9%
 
ح50.8%
 
ص50.8%
 
Other values (17)426.6%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
 21534.4%
 
Ê9014.4%
 
í487.7%
 
é335.3%
 
ó274.3%
 
°243.8%
 
á233.7%
 
£213.4%
 
®172.7%
 
ü152.4%
 
«111.8%
 
»101.6%
 
ñ101.6%
 
´101.6%
 
º71.1%
 
·61.0%
 
ö50.8%
 
¡50.8%
 
ú40.6%
 
©40.6%
 
ã40.6%
 
è30.5%
 
ô30.5%
 
Ö30.5%
 
ä30.5%
 
Other values (16)243.8%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
8315.0%
 
458.2%
 
376.7%
 
346.2%
 
325.8%
 
305.4%
 
295.3%
 
254.5%
 
ि244.3%
 
244.3%
 
244.3%
 
183.3%
 
142.5%
 
142.5%
 
122.2%
 
122.2%
 
101.8%
 
91.6%
 
81.4%
 
81.4%
 
81.4%
 
61.1%
 
61.1%
 
50.9%
 
40.7%
 
Other values (16)315.6%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
12471.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
21.1%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
Other values (3)31.7%
 

Most frequent Geometric Shapes characters

ValueCountFrequency (%) 
1235.3%
 
823.5%
 
617.6%
 
38.8%
 
25.9%
 
25.9%
 
12.9%
 

Most frequent Latin Ext A characters

ValueCountFrequency (%) 
ğ1118.3%
 
š915.0%
 
Ş813.3%
 
č813.3%
 
ı711.7%
 
ć610.0%
 
ş46.7%
 
İ35.0%
 
ď11.7%
 
ā11.7%
 
ē11.7%
 
ą11.7%
 

Most frequent Letterlike Symbols characters

ValueCountFrequency (%) 
233.3%
 
233.3%
 
116.7%
 
116.7%
 

Most frequent Diacriticals characters

ValueCountFrequency (%) 
̇738.9%
 
̶738.9%
 
͟422.2%
 

Most frequent Arrows characters

ValueCountFrequency (%) 
675.0%
 
112.5%
 
112.5%
 

Most frequent Braille characters

ValueCountFrequency (%) 
5100.0%
 

Most frequent Sup Arrows B characters

ValueCountFrequency (%) 
777.8%
 
222.2%
 

Most frequent Number Forms characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Geometric Shapes Ext characters

ValueCountFrequency (%) 
🟢457.1%
 
🟣114.3%
 
🟠114.3%
 
🟩114.3%
 

Most frequent CJK characters

ValueCountFrequency (%) 
43.3%
 
43.3%
 
43.3%
 
43.3%
 
32.5%
 
32.5%
 
32.5%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
21.7%
 
Other values (53)6049.6%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
о1513.6%
 
а87.3%
 
с87.3%
 
в76.4%
 
к65.5%
 
и65.5%
 
д65.5%
 
н65.5%
 
л54.5%
 
р43.6%
 
у43.6%
 
т43.6%
 
е43.6%
 
П32.7%
 
м32.7%
 
Р21.8%
 
ж21.8%
 
ч21.8%
 
б21.8%
 
В10.9%
 
і10.9%
 
С10.9%
 
Я10.9%
 
я10.9%
 
г10.9%
 
Other values (7)76.4%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
99.2%
 
55.1%
 
55.1%
 
55.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
33.1%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 
Other values (31)3131.6%
 

Most frequent Phonetic Ext characters

ValueCountFrequency (%) 
8727.3%
 
8627.0%
 
6520.4%
 
6520.4%
 
61.9%
 
51.6%
 
30.9%
 
20.6%
 

Most frequent IPA Ext characters

ValueCountFrequency (%) 
ɪ6447.4%
 
ɴ3324.4%
 
ʟ2720.0%
 
ə43.0%
 
ɡ21.5%
 
ʌ21.5%
 
ʘ21.5%
 
ɢ10.7%
 

Most frequent Math Alphanum characters

ValueCountFrequency (%) 
𝙖374.3%
 
𝙣323.7%
 
𝙮252.9%
 
𝗮242.8%
 
𝗼232.7%
 
𝗲222.6%
 
𝙚222.6%
 
𝙞222.6%
 
𝙤212.5%
 
𝙙212.5%
 
𝙧192.2%
 
𝗶182.1%
 
𝗻182.1%
 
𝘀172.0%
 
𝙢161.9%
 
𝙏161.9%
 
𝘁151.8%
 
𝗿151.8%
 
𝐢141.6%
 
𝗰131.5%
 
𝙈121.4%
 
𝐞121.4%
 
𝐚121.4%
 
𝗜111.3%
 
𝗔111.3%
 
Other values (131)38845.3%
 

Most frequent Misc Technical characters

ValueCountFrequency (%) 
1275.0%
 
212.5%
 
16.2%
 
16.2%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
1415.9%
 
89.1%
 
89.1%
 
66.8%
 
44.5%
 
44.5%
 
44.5%
 
33.4%
 
33.4%
 
33.4%
 
33.4%
 
ி33.4%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
22.3%
 
11.1%
 
11.1%
 
11.1%
 
Other values (2)22.3%
 

Most frequent Sinhala characters

ValueCountFrequency (%) 
416.0%
 
312.0%
 
312.0%
 
28.0%
 
28.0%
 
28.0%
 
28.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 
14.0%
 

Most frequent Thai characters

ValueCountFrequency (%) 
179.7%
 
158.5%
 
137.4%
 
105.7%
 
105.7%
 
95.1%
 
95.1%
 
84.5%
 
84.5%
 
52.8%
 
52.8%
 
52.8%
 
52.8%
 
42.3%
 
42.3%
 
42.3%
 
42.3%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
31.7%
 
21.1%
 
21.1%
 
21.1%
 
Other values (15)2011.4%
 

Most frequent Tags characters

ValueCountFrequency (%) 
󠁧216.7%
 
󠁢216.7%
 
󠁳216.7%
 
󠁣216.7%
 
󠁴216.7%
 
󠁿216.7%
 

Most frequent Math Operators characters

ValueCountFrequency (%) 
266.7%
 
133.3%
 

Most frequent Modifier Letters characters

ValueCountFrequency (%) 
ˈ787.5%
 
ˌ112.5%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
811.9%
 
69.0%
 
69.0%
 
57.5%
 
က57.5%
 
46.0%
 
46.0%
 
34.5%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
23.0%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 
11.5%
 

Most frequent Enclosed Alphanum characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
210.0%
 
210.0%
 
210.0%
 
210.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 

Most frequent Block Elements characters

ValueCountFrequency (%) 
650.0%
 
650.0%
 

Most frequent Telugu characters

ValueCountFrequency (%) 
417.4%
 
313.0%
 
ి28.7%
 
28.7%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 
14.3%
 

Most frequent Kannada characters

ValueCountFrequency (%) 
316.7%
 
ಿ211.1%
 
211.1%
 
211.1%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 
15.6%
 

Most frequent Arabic PF B characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Box Drawing characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent UCAS characters

ValueCountFrequency (%) 
1100.0%
 

Most frequent Alphabetic PF characters

ValueCountFrequency (%) 
1100.0%
 

hashtags
Categorical

HIGH CARDINALITY
MISSING

Distinct16835
Distinct (%)46.5%
Missing9816
Missing (%)21.3%
Memory size360.0 KiB
['Moderna']
 
2160
['Covaxin']
 
1705
['SputnikV']
 
1647
['PfizerBioNTech']
 
853
['OxfordAstraZeneca']
 
606
Other values (16830)
29272 
ValueCountFrequency (%) 
['Moderna']21604.7%
 
['Covaxin']17053.7%
 
['SputnikV']16473.6%
 
['PfizerBioNTech']8531.9%
 
['OxfordAstraZeneca']6061.3%
 
['COVID19']5861.3%
 
['moderna']5211.1%
 
['Sinopharm']4010.9%
 
['Sinovac']4010.9%
 
['COVAXIN']3630.8%
 
['covaxin']2460.5%
 
['oxfordastrazeneca']2300.5%
 
['Pfizer', 'Moderna']1870.4%
 
['PfizerBiontech']1660.4%
 
['vaccine']1450.3%
 
['Moderna', 'vaccine']1420.3%
 
['CovidVaccine']1280.3%
 
['AstraZeneca']1150.2%
 
['Moderna', 'CovidVaccine']1050.2%
 
['Covishield', 'Covaxin']1030.2%
 
['Moderna', 'COVID19']1030.2%
 
['CovidVaccine', 'Moderna']910.2%
 
['PfizerBioNTech', 'CovidVaccine']870.2%
 
['Pfizer']810.2%
 
['COVID19', 'vaccine']790.2%
 
Other values (16810)2499254.3%
 
(Missing)981621.3%
 
2021-05-15T20:26:58.048204image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique14702 ?
Unique (%)40.6%
2021-05-15T20:26:58.585051image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length142
Median length21
Mean length24.66050066
Min length3

Overview of Unicode Properties

Unique unicode characters442
Unique unicode categories12 ?
Unique unicode scripts14 ?
Unique unicode blocks17 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
'17163615.1%
 
n814797.2%
 
a782876.9%
 
i662365.8%
 
e601385.3%
 
o509484.5%
 
,495754.4%
 
495754.4%
 
c479064.2%
 
r402133.5%
 
[362433.2%
 
]362433.2%
 
d259662.3%
 
t237522.1%
 
v235972.1%
 
C214211.9%
 
s212211.9%
 
V204461.8%
 
h152031.3%
 
u122151.1%
 
S114691.0%
 
f112911.0%
 
I112791.0%
 
O112701.0%
 
M103230.9%
 
Other values (417)14790613.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter62291754.8%
 
Other Punctuation22121119.5%
 
Uppercase Letter14970013.2%
 
Space Separator495754.4%
 
Open Punctuation362433.2%
 
Close Punctuation362433.2%
 
Decimal Number179161.6%
 
Other Letter11140.1%
 
Connector Punctuation6460.1%
 
Modifier Letter123< 0.1%
 
Nonspacing Mark98< 0.1%
 
Spacing Mark52< 0.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
[36243100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
'17163677.6%
 
,4957522.4%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C2142114.3%
 
V2044613.7%
 
S114697.7%
 
I112797.5%
 
O112707.5%
 
M103236.9%
 
D93776.3%
 
P89296.0%
 
N71624.8%
 
A70974.7%
 
B68674.6%
 
T53963.6%
 
Z29792.0%
 
R22201.5%
 
E21731.5%
 
U17941.2%
 
H15991.1%
 
G14461.0%
 
F12080.8%
 
K11630.8%
 
L9980.7%
 
X9460.6%
 
W9070.6%
 
J8240.6%
 
Y2980.2%
 
Other values (15)1090.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n8147913.1%
 
a7828712.6%
 
i6623610.6%
 
e601389.7%
 
o509488.2%
 
c479067.7%
 
r402136.5%
 
d259664.2%
 
t237523.8%
 
v235973.8%
 
s212213.4%
 
h152032.4%
 
u122152.0%
 
f112911.8%
 
p101881.6%
 
l85891.4%
 
m84581.4%
 
z83441.3%
 
x77871.3%
 
k74771.2%
 
g40680.7%
 
y36490.6%
 
b24970.4%
 
w19310.3%
 
j6750.1%
 
Other values (76)8020.1%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
]36243100.0%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
49575100.0%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_646100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
1847547.3%
 
9803244.8%
 
26893.8%
 
02901.6%
 
41170.7%
 
5880.5%
 
3760.4%
 
7670.4%
 
8500.3%
 
6320.2%
 

Most frequent Other Letter characters

ValueCountFrequency (%) 
ا11510.3%
 
ر665.9%
 
ل524.7%
 
433.9%
 
و373.3%
 
ز353.1%
 
ن333.0%
 
ج322.9%
 
ي232.1%
 
211.9%
 
م201.8%
 
ی191.7%
 
161.4%
 
ئ151.3%
 
ک131.2%
 
ب131.2%
 
د131.2%
 
131.2%
 
121.1%
 
121.1%
 
111.0%
 
90.8%
 
ت80.7%
 
80.7%
 
ع80.7%
 
Other values (230)46741.9%
 

Most frequent Modifier Letter characters

ValueCountFrequency (%) 
123100.0%
 

Most frequent Nonspacing Mark characters

ValueCountFrequency (%) 
1515.3%
 
1212.2%
 
1010.2%
 
99.2%
 
99.2%
 
99.2%
 
77.1%
 
̇55.1%
 
44.1%
 
33.1%
 
33.1%
 
22.0%
 
22.0%
 
22.0%
 
22.0%
 
11.0%
 
11.0%
 
11.0%
 
11.0%
 

Most frequent Spacing Mark characters

ValueCountFrequency (%) 
1223.1%
 
ि1223.1%
 
1121.2%
 
917.3%
 
23.8%
 
23.8%
 
11.9%
 
11.9%
 
11.9%
 
11.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin77243668.0%
 
Common36195731.9%
 
Arabic5850.1%
 
Devanagari227< 0.1%
 
Thai145< 0.1%
 
Cyrillic106< 0.1%
 
Han104< 0.1%
 
Hangul95< 0.1%
 
Greek75< 0.1%
 
Katakana46< 0.1%
 
Myanmar31< 0.1%
 
Hiragana20< 0.1%
 
Tamil6< 0.1%
 
Inherited5< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
'17163647.4%
 
,4957513.7%
 
4957513.7%
 
[3624310.0%
 
]3624310.0%
 
184752.3%
 
980322.2%
 
26890.2%
 
_6460.2%
 
02900.1%
 
123< 0.1%
 
4117< 0.1%
 
588< 0.1%
 
376< 0.1%
 
767< 0.1%
 
850< 0.1%
 
632< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n8147910.5%
 
a7828710.1%
 
i662368.6%
 
e601387.8%
 
o509486.6%
 
c479066.2%
 
r402135.2%
 
d259663.4%
 
t237523.1%
 
v235973.1%
 
C214212.8%
 
s212212.7%
 
V204462.6%
 
h152032.0%
 
u122151.6%
 
S114691.5%
 
f112911.5%
 
I112791.5%
 
O112701.5%
 
M103231.3%
 
p101881.3%
 
D93771.2%
 
P89291.2%
 
l85891.1%
 
m84581.1%
 
Other values (62)8223510.6%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا11519.7%
 
ر6611.3%
 
ل528.9%
 
و376.3%
 
ز356.0%
 
ن335.6%
 
ج325.5%
 
ي233.9%
 
م203.4%
 
ی193.2%
 
ئ152.6%
 
ک132.2%
 
ب132.2%
 
د132.2%
 
ت81.4%
 
ع81.4%
 
خ71.2%
 
ك71.2%
 
ط71.2%
 
ے61.0%
 
س50.9%
 
ح50.9%
 
ة50.9%
 
ش40.7%
 
چ40.7%
 
Other values (11)335.6%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
о1514.2%
 
а87.5%
 
в76.6%
 
с76.6%
 
к65.7%
 
и65.7%
 
д65.7%
 
н65.7%
 
л54.7%
 
р43.8%
 
у43.8%
 
т43.8%
 
е43.8%
 
м32.8%
 
П21.9%
 
ж21.9%
 
ч21.9%
 
б21.9%
 
і10.9%
 
С10.9%
 
Я10.9%
 
Р10.9%
 
я10.9%
 
г10.9%
 
п10.9%
 
Other values (6)65.7%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
99.5%
 
55.3%
 
55.3%
 
55.3%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
Other values (28)2829.5%
 

Most frequent Han characters

ValueCountFrequency (%) 
43.8%
 
43.8%
 
43.8%
 
43.8%
 
32.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
Other values (42)4543.3%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
48.7%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
36.5%
 
24.3%
 
24.3%
 
24.3%
 
24.3%
 
24.3%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 
12.2%
 

Most frequent Thai characters

ValueCountFrequency (%) 
1611.0%
 
139.0%
 
128.3%
 
96.2%
 
96.2%
 
96.2%
 
74.8%
 
74.8%
 
64.1%
 
53.4%
 
42.8%
 
42.8%
 
42.8%
 
32.1%
 
32.1%
 
32.1%
 
32.1%
 
32.1%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
Other values (10)117.6%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
4318.9%
 
219.3%
 
156.6%
 
125.3%
 
125.3%
 
125.3%
 
ि125.3%
 
114.8%
 
114.8%
 
104.4%
 
94.0%
 
83.5%
 
73.1%
 
62.6%
 
62.6%
 
62.6%
 
52.2%
 
52.2%
 
52.2%
 
52.2%
 
20.9%
 
10.4%
 
10.4%
 
10.4%
 
10.4%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
412.9%
 
39.7%
 
က39.7%
 
26.5%
 
26.5%
 
26.5%
 
26.5%
 
26.5%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 

Most frequent Greek characters

ValueCountFrequency (%) 
ο1317.3%
 
μ912.0%
 
ι68.0%
 
α56.7%
 
ς45.3%
 
ε45.3%
 
β45.3%
 
λ45.3%
 
σ45.3%
 
κ22.7%
 
ρ22.7%
 
ν22.7%
 
Δ22.7%
 
Μ22.7%
 
Ε22.7%
 
υ22.7%
 
τ22.7%
 
ω11.3%
 
Θ11.3%
 
έ11.3%
 
Σ11.3%
 
π11.3%
 
ί11.3%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
210.0%
 
210.0%
 
210.0%
 
210.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 

Most frequent Inherited characters

ValueCountFrequency (%) 
̇5100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII113375399.8%
 
Arabic5850.1%
 
Phonetic Ext318< 0.1%
 
Devanagari227< 0.1%
 
Katakana169< 0.1%
 
Thai145< 0.1%
 
IPA Ext125< 0.1%
 
Cyrillic106< 0.1%
 
CJK104< 0.1%
 
Hangul95< 0.1%
 
None75< 0.1%
 
Latin 1 Sup60< 0.1%
 
Myanmar31< 0.1%
 
Hiragana20< 0.1%
 
Latin Ext A14< 0.1%
 
Tamil6< 0.1%
 
Diacriticals5< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
'17163615.1%
 
n814797.2%
 
a782876.9%
 
i662365.8%
 
e601385.3%
 
o509484.5%
 
,495754.4%
 
495754.4%
 
c479064.2%
 
r402133.5%
 
[362433.2%
 
]362433.2%
 
d259662.3%
 
t237522.1%
 
v235972.1%
 
C214211.9%
 
s212211.9%
 
V204461.8%
 
h152031.3%
 
u122151.1%
 
S114691.0%
 
f112911.0%
 
I112791.0%
 
O112701.0%
 
M103230.9%
 
Other values (43)14582112.9%
 

Most frequent Arabic characters

ValueCountFrequency (%) 
ا11519.7%
 
ر6611.3%
 
ل528.9%
 
و376.3%
 
ز356.0%
 
ن335.6%
 
ج325.5%
 
ي233.9%
 
م203.4%
 
ی193.2%
 
ئ152.6%
 
ک132.2%
 
ب132.2%
 
د132.2%
 
ت81.4%
 
ع81.4%
 
خ71.2%
 
ك71.2%
 
ط71.2%
 
ے61.0%
 
س50.9%
 
ح50.9%
 
ة50.9%
 
ش40.7%
 
چ40.7%
 
Other values (11)335.6%
 

Most frequent Katakana characters

ValueCountFrequency (%) 
12372.8%
 
42.4%
 
31.8%
 
31.8%
 
31.8%
 
31.8%
 
31.8%
 
31.8%
 
21.2%
 
21.2%
 
21.2%
 
21.2%
 
21.2%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
10.6%
 
Other values (2)21.2%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
ó1220.0%
 
ü813.3%
 
é813.3%
 
á813.3%
 
í610.0%
 
ö35.0%
 
Í23.3%
 
ñ23.3%
 
Ç23.3%
 
ä23.3%
 
Ó11.7%
 
è11.7%
 
ú11.7%
 
ý11.7%
 
à11.7%
 
ï11.7%
 
ç11.7%
 

Most frequent Cyrillic characters

ValueCountFrequency (%) 
о1514.2%
 
а87.5%
 
в76.6%
 
с76.6%
 
к65.7%
 
и65.7%
 
д65.7%
 
н65.7%
 
л54.7%
 
р43.8%
 
у43.8%
 
т43.8%
 
е43.8%
 
м32.8%
 
П21.9%
 
ж21.9%
 
ч21.9%
 
б21.9%
 
і10.9%
 
С10.9%
 
Я10.9%
 
Р10.9%
 
я10.9%
 
г10.9%
 
п10.9%
 
Other values (6)65.7%
 

Most frequent Hangul characters

ValueCountFrequency (%) 
99.5%
 
55.3%
 
55.3%
 
55.3%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
33.2%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
22.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
11.1%
 
Other values (28)2829.5%
 

Most frequent Phonetic Ext characters

ValueCountFrequency (%) 
8727.4%
 
8627.0%
 
6520.4%
 
6420.1%
 
61.9%
 
51.6%
 
30.9%
 
20.6%
 

Most frequent IPA Ext characters

ValueCountFrequency (%) 
ɪ6451.2%
 
ɴ3326.4%
 
ʟ2721.6%
 
ɢ10.8%
 

Most frequent CJK characters

ValueCountFrequency (%) 
43.8%
 
43.8%
 
43.8%
 
43.8%
 
32.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
21.9%
 
Other values (42)4543.3%
 

Most frequent Thai characters

ValueCountFrequency (%) 
1611.0%
 
139.0%
 
128.3%
 
96.2%
 
96.2%
 
96.2%
 
74.8%
 
74.8%
 
64.1%
 
53.4%
 
42.8%
 
42.8%
 
42.8%
 
32.1%
 
32.1%
 
32.1%
 
32.1%
 
32.1%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
21.4%
 
Other values (10)117.6%
 

Most frequent Latin Ext A characters

ValueCountFrequency (%) 
ı428.6%
 
č321.4%
 
ş214.3%
 
İ214.3%
 
ğ214.3%
 
ć17.1%
 

Most frequent Devanagari characters

ValueCountFrequency (%) 
4318.9%
 
219.3%
 
156.6%
 
125.3%
 
125.3%
 
125.3%
 
ि125.3%
 
114.8%
 
114.8%
 
104.4%
 
94.0%
 
83.5%
 
73.1%
 
62.6%
 
62.6%
 
62.6%
 
52.2%
 
52.2%
 
52.2%
 
52.2%
 
20.9%
 
10.4%
 
10.4%
 
10.4%
 
10.4%
 

Most frequent Tamil characters

ValueCountFrequency (%) 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 
116.7%
 

Most frequent Myanmar characters

ValueCountFrequency (%) 
412.9%
 
39.7%
 
က39.7%
 
26.5%
 
26.5%
 
26.5%
 
26.5%
 
26.5%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 
13.2%
 

Most frequent None characters

ValueCountFrequency (%) 
ο1317.3%
 
μ912.0%
 
ι68.0%
 
α56.7%
 
ς45.3%
 
ε45.3%
 
β45.3%
 
λ45.3%
 
σ45.3%
 
κ22.7%
 
ρ22.7%
 
ν22.7%
 
Δ22.7%
 
Μ22.7%
 
Ε22.7%
 
υ22.7%
 
τ22.7%
 
ω11.3%
 
Θ11.3%
 
έ11.3%
 
Σ11.3%
 
π11.3%
 
ί11.3%
 

Most frequent Hiragana characters

ValueCountFrequency (%) 
210.0%
 
210.0%
 
210.0%
 
210.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 
15.0%
 

Most frequent Diacriticals characters

ValueCountFrequency (%) 
̇5100.0%
 

source
Categorical

HIGH CARDINALITY

Distinct171
Distinct (%)0.4%
Missing42
Missing (%)0.1%
Memory size360.0 KiB
Twitter Web App
14538 
Twitter for iPhone
13516 
Twitter for Android
11763 
TweetDeck
2157 
Twitter for iPad
 
995
Other values (166)
3048 
ValueCountFrequency (%) 
Twitter Web App1453831.6%
 
Twitter for iPhone1351629.3%
 
Twitter for Android1176325.5%
 
TweetDeck21574.7%
 
Twitter for iPad9952.2%
 
Instagram7451.6%
 
Hootsuite Inc.4821.0%
 
Buffer1990.4%
 
Twitter Media Studio1750.4%
 
IFTTT1130.2%
 
Etus Brasil900.2%
 
WordPress.com850.2%
 
Hocalwire Social Share800.2%
 
Sprout Social790.2%
 
Twitter Media Studio - LiveCut500.1%
 
LinkedIn480.1%
 
Blog2Social APP480.1%
 
Twitter for Mac420.1%
 
Tickeron350.1%
 
Tweetbot for iΟS320.1%
 
SocialFlow320.1%
 
dlvr.it310.1%
 
IndiaPost270.1%
 
Smarp.260.1%
 
Flying Eze250.1%
 
Other values (146)6041.3%
 
(Missing)420.1%
 
2021-05-15T20:26:59.236912image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique53 ?
Unique (%)0.1%
2021-05-15T20:26:59.603912image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length32
Median length18
Mean length16.43685273
Min length3

Overview of Unicode Properties

Unique unicode characters70
Unique unicode categories10 ?
Unique unicode scripts3 ?
Unique unicode blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
t8698411.5%
 
8364311.0%
 
r8126010.7%
 
e7770310.3%
 
i695309.2%
 
o541567.2%
 
T438385.8%
 
w435515.8%
 
p293283.9%
 
n271133.6%
 
f268583.5%
 
A264533.5%
 
d253883.4%
 
P148302.0%
 
b147261.9%
 
W146471.9%
 
h137381.8%
 
a37610.5%
 
c33730.4%
 
k23440.3%
 
D21880.3%
 
s19130.3%
 
I14390.2%
 
u12670.2%
 
m9560.1%
 
Other values (45)60780.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter56604274.8%
 
Uppercase Letter10646214.1%
 
Space Separator8364911.0%
 
Other Punctuation6970.1%
 
Decimal Number89< 0.1%
 
Dash Punctuation86< 0.1%
 
Connector Punctuation20< 0.1%
 
Open Punctuation7< 0.1%
 
Close Punctuation7< 0.1%
 
Other Symbol6< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T4383841.2%
 
A2645324.8%
 
P1483013.9%
 
W1464713.8%
 
D21882.1%
 
I14391.4%
 
S9330.9%
 
H5980.6%
 
B3580.3%
 
M3280.3%
 
F1960.2%
 
E1710.2%
 
L1160.1%
 
C970.1%
 
N780.1%
 
O34< 0.1%
 
Ο32< 0.1%
 
R31< 0.1%
 
V21< 0.1%
 
Z19< 0.1%
 
G18< 0.1%
 
U13< 0.1%
 
K12< 0.1%
 
X7< 0.1%
 
J3< 0.1%
 
Other values (2)2< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t8698415.4%
 
r8126014.4%
 
e7770313.7%
 
i6953012.3%
 
o541569.6%
 
w435517.7%
 
p293285.2%
 
n271134.8%
 
f268584.7%
 
d253884.5%
 
b147262.6%
 
h137382.4%
 
a37610.7%
 
c33730.6%
 
k23440.4%
 
s19130.3%
 
u12670.2%
 
m9560.2%
 
l9430.2%
 
g9190.2%
 
v128< 0.1%
 
y54< 0.1%
 
z27< 0.1%
 
x13< 0.1%
 
q8< 0.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
83643> 99.9%
 
 6< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.69099.0%
 
,40.6%
 
:30.4%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
26977.5%
 
41314.6%
 
122.2%
 
022.2%
 
611.1%
 
711.1%
 
511.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-86100.0%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_20100.0%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(7100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)7100.0%
 

Most frequent Other Symbol characters

ValueCountFrequency (%) 
🦉6100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin67247288.8%
 
Common8456111.2%
 
Greek32< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t8698412.9%
 
r8126012.1%
 
e7770311.6%
 
i6953010.3%
 
o541568.1%
 
T438386.5%
 
w435516.5%
 
p293284.4%
 
n271134.0%
 
f268584.0%
 
A264533.9%
 
d253883.8%
 
P148302.2%
 
b147262.2%
 
W146472.2%
 
h137382.0%
 
a37610.6%
 
c33730.5%
 
k23440.3%
 
D21880.3%
 
s19130.3%
 
I14390.2%
 
u12670.2%
 
m9560.1%
 
l9430.1%
 
Other values (27)41850.6%
 

Most frequent Common characters

ValueCountFrequency (%) 
8364398.9%
 
.6900.8%
 
-860.1%
 
2690.1%
 
_20< 0.1%
 
413< 0.1%
 
(7< 0.1%
 
)7< 0.1%
 
 6< 0.1%
 
🦉6< 0.1%
 
,4< 0.1%
 
:3< 0.1%
 
12< 0.1%
 
02< 0.1%
 
61< 0.1%
 
71< 0.1%
 
51< 0.1%
 

Most frequent Greek characters

ValueCountFrequency (%) 
Ο32100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII757021> 99.9%
 
None38< 0.1%
 
Latin 1 Sup6< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
t8698411.5%
 
8364311.0%
 
r8126010.7%
 
e7770310.3%
 
i695309.2%
 
o541567.2%
 
T438385.8%
 
w435515.8%
 
p293283.9%
 
n271133.6%
 
f268583.5%
 
A264533.5%
 
d253883.4%
 
P148302.0%
 
b147261.9%
 
W146471.9%
 
h137381.8%
 
a37610.5%
 
c33730.4%
 
k23440.3%
 
D21880.3%
 
s19130.3%
 
I14390.2%
 
u12670.2%
 
m9560.1%
 
Other values (42)60340.8%
 

Most frequent None characters

ValueCountFrequency (%) 
Ο3284.2%
 
🦉615.8%
 

Most frequent Latin 1 Sup characters

ValueCountFrequency (%) 
 6100.0%
 

retweets
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct239
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.084196357
Minimum0
Maximum6683
Zeros30075
Zeros (%)65.3%
Memory size360.0 KiB
2021-05-15T20:26:59.964004image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile7
Maximum6683
Range6683
Interquartile range (IQR)1

Descriptive statistics

Standard deviation44.72176481
Coefficient of variation (CV)14.50029753
Kurtosis11458.64401
Mean3.084196357
Median Absolute Deviation (MAD)0
Skewness88.08390544
Sum142055
Variance2000.036247
MonotocityNot monotonic
2021-05-15T20:27:00.303178image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
03007565.3%
 
1788717.1%
 
225535.5%
 
313372.9%
 
48061.7%
 
54781.0%
 
63680.8%
 
73000.7%
 
82450.5%
 
91820.4%
 
101490.3%
 
121420.3%
 
111180.3%
 
131090.2%
 
14900.2%
 
15870.2%
 
16680.1%
 
18510.1%
 
17500.1%
 
19410.1%
 
20370.1%
 
23320.1%
 
24290.1%
 
25290.1%
 
27270.1%
 
Other values (214)7691.7%
 
ValueCountFrequency (%) 
03007565.3%
 
1788717.1%
 
225535.5%
 
313372.9%
 
48061.7%
 
54781.0%
 
63680.8%
 
73000.7%
 
82450.5%
 
91820.4%
 
ValueCountFrequency (%) 
66831< 0.1%
 
23601< 0.1%
 
22471< 0.1%
 
20951< 0.1%
 
19802< 0.1%
 
15151< 0.1%
 
12811< 0.1%
 
9381< 0.1%
 
9221< 0.1%
 
8701< 0.1%
 

favorites
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct503
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.52170043
Minimum0
Maximum22815
Zeros19255
Zeros (%)41.8%
Memory size360.0 KiB
2021-05-15T20:27:00.660036image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34
95-th percentile31
Maximum22815
Range22815
Interquartile range (IQR)4

Descriptive statistics

Standard deviation191.984916
Coefficient of variation (CV)14.19828202
Kurtosis6283.978438
Mean13.52170043
Median Absolute Deviation (MAD)1
Skewness66.48283436
Sum622796
Variance36858.20798
MonotocityNot monotonic
2021-05-15T20:27:01.030870image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01925541.8%
 
1847818.4%
 
241969.1%
 
325035.4%
 
416793.6%
 
511892.6%
 
69742.1%
 
77221.6%
 
85931.3%
 
94691.0%
 
113960.9%
 
103760.8%
 
123230.7%
 
132770.6%
 
142410.5%
 
152240.5%
 
162240.5%
 
171870.4%
 
181690.4%
 
201480.3%
 
211350.3%
 
191330.3%
 
231170.3%
 
221080.2%
 
26980.2%
 
Other values (478)28456.2%
 
ValueCountFrequency (%) 
01925541.8%
 
1847818.4%
 
241969.1%
 
325035.4%
 
416793.6%
 
511892.6%
 
69742.1%
 
77221.6%
 
85931.3%
 
94691.0%
 
ValueCountFrequency (%) 
228151< 0.1%
 
174321< 0.1%
 
94581< 0.1%
 
84701< 0.1%
 
81531< 0.1%
 
80981< 0.1%
 
66511< 0.1%
 
61631< 0.1%
 
58271< 0.1%
 
55751< 0.1%
 

is_retweet
Boolean

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size45.1 KiB
False
46059 
ValueCountFrequency (%) 
False46059100.0%
 
2021-05-15T20:27:01.263199image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Interactions

2021-05-15T20:26:31.094318image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:31.439222image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:31.787136image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:32.324041image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:32.610092image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:32.979574image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:33.299964image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:33.662149image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:34.036388image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:34.459899image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:34.898006image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:35.345233image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:35.699969image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:36.148124image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:36.598899image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:37.002194image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:37.398274image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:37.778284image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:38.150845image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:38.513095image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:38.909269image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:39.268876image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:39.648336image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:40.010092image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:40.408359image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:40.833047image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:41.462994image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:41.828959image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:42.159277image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:42.514073image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:42.876003image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:43.201923image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:43.591157image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:44.223241image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:44.476203image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:44.813030image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Correlations

2021-05-15T20:27:01.473801image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-05-15T20:27:01.935843image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-05-15T20:27:02.548362image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-05-15T20:27:02.995945image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2021-05-15T20:26:45.507501image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:46.224207image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:47.159049image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2021-05-15T20:26:47.649898image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Sample

First rows

iduser_nameuser_locationuser_descriptionuser_createduser_followersuser_friendsuser_favouritesuser_verifieddatetexthashtagssourceretweetsfavoritesis_retweet
01340539111971516416Rachel RohLa Crescenta-Montrose, CAAggregator of Asian American news; scanning diverse sources 24/7/365. RT's, Follows and 'Likes' will fuel me 👩‍💻2009-04-08 17:52:4640516923247False2020-12-20 06:06:44Same folks said daikon paste could treat a cytokine storm #PfizerBioNTech https://t.co/xeHhIMg1kF['PfizerBioNTech']Twitter for Android00False
11338158543359250433Albert FongSan Francisco, CAMarketing dude, tech geek, heavy metal & '80s music junkie. Fascinated by meteorology and all things in the cloud. Opinions are my own.2009-09-21 15:27:30834666178False2020-12-13 16:27:13While the world has been on the wrong side of history this year, hopefully, the biggest vaccination effort we've ev… https://t.co/dlCHrZjkhmNaNTwitter Web App11False
21337858199140118533eli🇱🇹🇪🇺👌Your Bedheil, hydra 🖐☺2020-06-25 23:30:281088155False2020-12-12 20:33:45#coronavirus #SputnikV #AstraZeneca #PfizerBioNTech #Moderna #Covid_19 Russian vaccine is created to last 2-4 years… https://t.co/ieYlCKBr8P['coronavirus', 'SputnikV', 'AstraZeneca', 'PfizerBioNTech', 'Moderna', 'Covid_19']Twitter for Android00False
31337855739918835717Charles AdlerVancouver, BC - CanadaHosting "CharlesAdlerTonight" Global News Radio Network. Weeknights 7 Pacific-10 Eastern - Email comments/ideas to charles@charlesadlertonight.ca2008-09-10 11:28:5349165393321853True2020-12-12 20:23:59Facts are immutable, Senator, even when you're not ethically sturdy enough to acknowledge them. (1) You were born i… https://t.co/jqgV18kch4NaNTwitter Web App4462129False
41337854064604966912Citizen News ChannelNaNCitizen News Channel bringing you an alternative news source from citizen journalists that haven't sold out. Real news & real views2020-04-23 17:58:421525801473False2020-12-12 20:17:19Explain to me again why we need a vaccine @BorisJohnson @MattHancock #whereareallthesickpeople #PfizerBioNTech… https://t.co/KxbSRoBEHq['whereareallthesickpeople', 'PfizerBioNTech']Twitter for iPhone00False
51337852648389832708DeeBirmingham, EnglandGastroenterology trainee, Clinical Research Fellow in IBD, mother to human and fur baby, Canadian in Britain2020-01-26 21:43:12105108106False2020-12-12 20:11:42Does anyone have any useful advice/guidance for whether the COVID vaccine is safe whilst breastfeeding?… https://t.co/EifsyQoeKNNaNTwitter for iPhone00False
61337851215875608579Gunther FehlingerAustria, Ukraine and KosovoEnd North Stream 2 now - the pipeline of corruption, funding Russias war against Ukraine,Georgia, Syria and political intervention in USA and EU must be stopped2013-06-10 17:49:222731500169344False2020-12-12 20:06:00it is a bit sad to claim the fame for success of #vaccination on patriotic competition between USA, Canada, UK and… https://t.co/IfMrAyGyTP['vaccination']Twitter Web App04False
71337850832256176136Dr.Krutika KuppalliNaNID, Global Health, VHF, Pandemic Prep, Emerging Infections, & Health Policy MD| U.S. Congress COVID-19 expert witness x 2 | ELBI 2020 @JHSPH_CHS2019-03-25 04:14:29219245937815True2020-12-12 20:04:29There have not been many bright days in 2020 but here are some of the best \n1. #BidenHarris winning #Election2020… https://t.co/77u4f8XXfx['BidenHarris', 'Election2020']Twitter for iPhone222False
81337850023531347969Erin DespasNaNDesigning&selling on Teespring. Like 90s Disney tv movies, old school WWE. Dislikes Intolerance, hate, bigots and snakes https://t.co/fa5n4gEHgR2009-10-30 17:53:5488715159639False2020-12-12 20:01:16Covid vaccine; You getting it?\n\n #CovidVaccine #covid19 #PfizerBioNTech #Moderna['CovidVaccine', 'covid19', 'PfizerBioNTech', 'Moderna']Twitter Web App21False
91337842295857623042Ch.Amjad AliIslamabad#ProudPakistani #LovePakArmy #PMIK @insafianspower1\n#PoliticalScience #InternationalAffairs \n#PAKUSTV #Newyork #Islamabad2012-11-12 04:18:12671236820469False2020-12-12 19:30:33#CovidVaccine \n\nStates will start getting #COVID19Vaccine Monday, #US says \n#pakustv #NYC #Healthcare #GlobalGoals… https://t.co/MksOvBvs5w['CovidVaccine', 'COVID19Vaccine', 'US', 'pakustv', 'NYC', 'Healthcare', 'GlobalGoals']Twitter Web App00False

Last rows

iduser_nameuser_locationuser_descriptionuser_createduser_followersuser_friendsuser_favouritesuser_verifieddatetexthashtagssourceretweetsfavoritesis_retweet
460491376099384056766465mmnjug™KenyaFrom where I sit: There is a certain majesty in simplicity.2009-04-08 07:48:104224817922658False2021-03-28 09:10:34For #SputnikV, its storage temperature is -18°C Its two doses are given 21 days apart, unlike the eight weeks apart… https://t.co/1LkJT4lGPR['SputnikV']Twitter for Android00False
460501376099072474546176mmnjug™KenyaFrom where I sit: There is a certain majesty in simplicity.2009-04-08 07:48:104224817922658False2021-03-28 09:09:20#SputnikV coronavirus vaccine offers around 92% protection against Covid-19, according to late-stage trial results published in The Lancet.['SputnikV']Twitter for Android00False
460511376096823102865412Passionate PanafricanistNaNThe greatest war, the #Negro; Africa faces is the #ideological WAR. When WE OVERCOME IT, we WILL CONQUER THE #WORLD.2021-01-12 13:42:331298492505False2021-03-28 09:00:23@Consumers_Kenya @MOH_Kenya #SputnikV has good ratings globally and FRANKLY MAYBE ITS TIME TO TRUST WORKING WITH… https://t.co/fs4F23CmYr['SputnikV']Twitter for Android00False
460521376094143399796736mmnjug™KenyaFrom where I sit: There is a certain majesty in simplicity.2009-04-08 07:48:104224817922658False2021-03-28 08:49:45Questions have emerged on who, between the Pharmacy &amp; Poisons Board and its parent @MOH_Kenya, is telling the truth… https://t.co/hw2xSPjFjuNaNTwitter for Android64False
460531376084087312637963Douglas HerbertParisParis-based commentator at @France24. Also an avid tweeter of historical photos and artworks that catch my fancy. Instagram: @dougherbertf242009-02-27 17:18:30926524131529True2021-03-28 08:09:47Universal access: Some shopping malls in the Urals capital city of #Yekaterinburg are offering the #SputnikV vaccin… https://t.co/z9zgK4lkG1['Yekaterinburg', 'SputnikV']Twitter for iPhone06False
460541376080077054746624Consumer GrassrootsKenyaOfficial Account for Consumer Grassroots Association (CGA). We Empower Consumers to Protect Themselves. Stay informed to stay safe. Updates https://t.co/AflCO8VWrU2016-11-30 08:14:192302144719882True2021-03-28 07:53:51Russian Covid-19 vaccine #SputnikV now in Kenya. On 25th March 2021, @MOH_Kenya warned Kenyans against taking the v… https://t.co/Lgu8YSOrjJ['SputnikV']Twitter Web App12False
460551376073682381107201Michael MuchiriNairobiCivil Engineer working in Kenya. With a Passion for development and maintenance of infrastructure.2011-01-31 17:39:38159535156795False2021-03-28 07:28:26Communique on COVID19 Lockdown Effects on USIU University Programmes this Semester.\n@ExperienceUSIU @USIUAlumni… https://t.co/qr3eiQ8xgrNaNTwitter for Android00False
460561376068500360470529Michael MuchiriNairobiCivil Engineer working in Kenya. With a Passion for development and maintenance of infrastructure.2011-01-31 17:39:38159535156795False2021-03-28 07:07:51Mask is worn on the face, for protection, disguise, performance, or entertainment; ceremonial &amp; practical purposes,… https://t.co/UPP1isIFLxNaNTwitter for Android00False
460571376058766454624261Stankevicius InternationalDublin, IrelandProfessional trading consultant specializing in contracting and due diligence with a strong pres­ence and network in international markets.2020-06-30 12:31:421630False2021-03-28 06:29:10Selling: #NitrileGloves, #1860 #FaceMasks, #Vaccines #SputnikV, #syringes. Contact sales: https://t.co/gWmRopLARO o… https://t.co/iRFg3e9lRc['NitrileGloves', 'FaceMasks', 'Vaccines', 'SputnikV', 'syringes']IFTTT01False
460581376057426793861122Firras JabarIraqNaN2018-10-29 13:16:05452054492False2021-03-28 06:23:51#Novartis. #Pfizer #vaccine #VaccinePassports #Sinopharm #astrazenecavaccine #SputnikV #COVID19 @save_children… https://t.co/pcLE5yU6IF['Novartis', 'Pfizer', 'vaccine', 'VaccinePassports', 'Sinopharm', 'astrazenecavaccine', 'SputnikV', 'COVID19']Twitter for Android00False